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Hepatitis C Virus Cell Culture System 



5 ABSTRACT 

The hepatitis C virus (HCV) cell culture system according 
to the invention consists of human hepatoma cells, which 
are transfected with a HCV-RNA construct, that comprises 
10 the HCV specific RNA segments 5' NTR, NS3, NS4A, NS4B, 
NS5A, NS5B, and 3 1 NTR as well as a minimum of one marker 
gene for selection (selection gene) . 
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Hepatitis C Virus Cell Culture System 

DESCRIPTION 

5 The invention relates to a hepatitis C virus (HCV) cell 
culture system, which comprises mainly eukaryotic cells 
containing transfected HCV specific genetic material, which 
means they are transfected with HCV specific genetic 
material. 

The hepatitis C virus (HCV) is one of the main causes 
worldwide of chronic and sporadic liver diseases. The 
history of most HCV infections does not involve any obvious 
clinical signs, but 80 - 90 % of the infected people become 

15 chronic carriers of the virus and 50 % of these chronic 
carriers of the virus develop chronic hepatitis with 
different degrees of severity. Approx. 20 % of the 
chronically infected develop a cirrhosis of the liver over 
10 to 20 years, based on what a primary hepatocellular 

20 carcinoma can develop. Nowadays chronic hepatitis C is the 
main indication for liver transplantation. A specific 
therapy does not exist until now. The only therapy 
currently available is high-dose administration of 
Interferon alpha or a combination of Interferon alpha and 

25 the purine nucleoside analogue Ribavirin. However, only 
approx. 60 % of all. treated persons respond to this therapy 
and with these, a new viraemia occurs in more than half of 
all cases after the discontinuation of the treatment. 

30 Due to the high prevalence, especially in industrialized 
countries, the serious effects of chronic infections and 
the non-existence of a specific therapy, the development of 
a HCV specific chemotherapy is an important goal of 
pharmaceutical research and development. The main problem 

35 lies in the previous lack of a suitable cell culture 
system, which enables the study of virus replication and 
pathogenesis in eukaryotic cells. 
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Due to the small amount of virus in blood or tissue, the 
lack of suitable cell culture systems or animal models (the 
chimpanzee is still the only possible experimental animal) 
5 as well as the lack of efficient systems for producing 
virus-like particles, it was not possible up to now, to 
analyze the molecular composition of the HCV particle in- 
depth and to solve it. The information currently available 
can be summarized as follows: HCV is an enveloped plus- 

10 strand RNA virus with a particle diameter of 50 - 60 nm and 
a medium density of 1.03 - 1.1 g/ml. It was molecularly 
cloned and characterized for the first time in 1989 (Choo 
et al., 1939, Science, 244, 359 - 362). The HCV-RNA has a 
length of approx. 9.6 kb (= 9600 nucleotides), a positive 

15 polarity and comprises one open reading frame (ORF) , which 
encodes a linear polyprotein of approx. 3010 amino acids 
(see Rice 1996, in Virology, B. N. Fields, D. M. Knipe, P. 
M. Howley, Eds. (Lippincott-Raven, Philadelphia, PA, 1996), 
vol. 1, pp. 931 - 960; Clarke 1997, J. Gen. Virol. 78, 

20 2397; and Bartenschlager 1997, In tervlrology 40, 378 and 
see Fig. 1 A). During the replication of the virus the 
polyprotein is cleaved into the mature and functionally 
active proteins by cellular and viral proteases. 

25 Within the polyprotein the proteins are arranged as follows 
(from the amino- to. the carboxy terminus) : Core-El-E2-p7- 
NS2-NS3-NS4A-NS4B-NS5A-NS5B. The core protein is the main 
component of the nucleocapsid . The glycoproteins El and E2 
are transmembrane proteins and the main components of the 

30 viral envelope. They probably play an important role during 
the ^ttachement of the virus to the host cell. These three 
proteins core, El, and E2 constitute the viral particle and 
are therefore called structural proteins. The function of 
the protein p7 is still not clear. The protein NS2 is 

35 probably the catalytic domain of the NS2-3 protease, which 
is responsible for the processing between the proteins NS2 
and NS3. The protein NS3 has two functions, one is a 
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protease activity in the amino terminal domain, which is 
essential for the polyprotein processing, and the other a 
NTPase/helicase function in the carboxy terminal domain, 
which is probably important during the replication of the 
5 viral RNA. The protein NS4A is a co-factor of the NS3 
protease. The function of the protein NS4B is unknown. 

The open reading frame is flanked on its 5' end by a non- 
translated region (NTR) approx. 340 nucleotides in length, 

10 which functions as the internal ribosome entry site (IRES), 
and on its 3 1 end by a NTR approx. 230 nucleotides in 
length, which is most likely important for the genome 
replication. A 3 r NTR such as this is the object of patent 
application PCT/US 96/14033. The structural proteins in the 

L5 amino terminal quarter of the polyprotein are cleaved by 
host cell signal peptidase. The non-structural proteins 
(NS) 2 to (NS) 5B are processed by two viral enzymes, 
namely the NS2-3 and the NS3/4A protease. The NS3/4A 
protease is required for all cleavages beyond the carboxy 

20 terminus of NS3. The function of NS4B is unknown. NS5A, a 
highly phosphorylated protein, seems to be responsible for 
the Interferon resistance of various HCV genotypes (see 
Enomoto et al. 1995, J. Clin. Invest. 96, 224; Enomoto et 
al. 1996, N. Engl. J. Med. 334, 77; Gale Jr. et al. 1997, 

25 Virology 230, 217; Kaneko et al. 1994, Biochem. Biophys. 
Res. Commun. 205, .320; Reed et al., 1997, J. Virol. 71, 
7187), and NS5B has been identified as the RNA-dependent 
RNA polymerase. 

30 First diagnostic systems have been developed from these 
findings, which are either based on the detection of HCV 
specific antibodies in patient serum or the detection of 
HCV specific RNA using the reverse transcription polymerase 
chain reaction (RT-PCR) , and which are (must be) routinely 

35 used with all blood and blood products and/or according to 
the regulations. 
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Since the first description' of the genome in 1989 several 
partial and complete sequences of the HCV have been cloned 
and characterized using the PCR method. A comparison of 
these sequences shows a high variability of the viral 
5 genome in particular in the area of the NS5B gene, which 
eventually resulted in the classification of 6 genotypes, 
which are again subdivided into the subtypes a, b, and c. 

The genomic variance is not evenly distributed over the 
10 genome. The 5 1 NTR and parts of the 3'NTR are highly 
conserved, while certain encoded sequences vary a lot, in 
particular the envelope proteins El and E2. 

The cloned and characterized partial and complete sequences 
15 of the HCV genome have also been analyzed with regard to 
appropriate targets for a prospective antiviral therapy. In 
the course of this, three viral enzymes have been 
discovered, which may provide a possible target. These 
include (1) the NS3/4A protease complex, (2) the NS3 
20 Helicase and (3) the NS5B RNA-dependent RNA polymerase. The 
NS3/4A protease complex and the NS3 Helicase have already 
been crystallized and their three-dimensional structure 
determined (Kim et al., 1996, Cell, 87,343; Yem et al. f 
1998, Protein Science, 7, 837; Love et al., 1996, Cell, 87, 
25 311; Kim et al . , 1998, Structure, 6, 89; Yao et al., 1997, 
Nature Structural Biology, 4, 463, Cho et al., 1998, J. 
Biol. Chem., 273, 15045). it has not been successful until 
now with the NS5B RNA-dependent RNA polymerase.. 

30 Even though important targets for the development of a 
therapy for chronic HCV infection have been defined with 
these enzymes and even though a worldwide intensive search 
.for suitable inhibitors is ongoing with the aid of rational 
drug design as well as high throughput screening, the 

35 development of a therapy has one major deficiency, namely 
the lack of cell culture systems or simple animal models, 
which allow direct, reliable identification of HCV-RNA or 
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HCV antigens with simple methods which are common in the 
laboratory. The lack of these cell culture systems is also 
the main reason that to date the comprehension of HCV 
replication is still incomplete and mainly hypothetical. 

5 

Although according to the experts, a close evolutionary 
relationship exists between HCV and the flavi- and 
pestiviruses, and self -replicating RNAs have been described 
for these, which can be used for the replication in 
10 different cell lines with a relatively high yield, (see 
Khromykh et al., 1997, J. Virol. 71, 1497; Behrens et al., 
1998, J. Virol. 72, 2364; Moser et al., 1998, J. Virol. 72, 
5318), similar experiments with HCV have not been 
successful to date. 

15 

Although it is known from different publications that cell 
lines or primary cell cultures can be infected with high 
titre patient serum containing HCV, (Lanford et al. 1994, 
Virology 202, 606; Shimizu et al. 1993, Proceedings of the 

20 National Academy of Sciences, USA, 90, 6037 - 6041; 
Mizutani et al. 1996, Journal of Virology, 70, 7219 - 7223; 
M. Ikeda et al. 1998, Virus Res. 56, 157; Fournier et al. 
1998, <7. Gen. Virol. 79, 2376 and bibliographical 
references quoted in here; Ito et al . 1996, Journal of 

25 General Virology, 77, 1043 - 1054), these virus-infected 
cell lines or cell cultures do not allow the direct 
detection of HCV-RNA or HCV antigens. The viral RNA in 
these cells can not be detected in a Northern Blot (a 
standard method for the quantitative detection of RNA) or 

30 the viral protein in a Western Blot or with 
immunoprecipitation. It has only been possible to detect 
HCV replication with very costly and indirect methods. 
These disadvantageous facts show that obviously the 
replication in these known virus-infected cell lines or 

35 cell cultures is completely insufficient. 



CA 02303526 2000-03-31 

- 6 - 



Furthermore it is known from the publications of Yoo et al. 
(1995, Journal of Virology, 69, 32 - 38) and of Dash et 
al., (1997, American Journal of Pathology, 151, 363 - 373) 
that hepatoma cell lines can be transfected with synthetic 
5 HCV-RNA, which are obtained through in vitro transcription 
of the cloned HCV genome. In both publications the authors 
started from the basic idea that the viral HCV genome is a 
plus-strand RNA functioning directly as mRNA after being 
transfected into the cell, permitting the synthesis of 

10 viral proteins in the course of the translation process, 
and so new HCV particles are (could be) formed. This viral 
replication, which means these newly formed HCV viruses and 
their RNA, have been detected through RT-PCR. However the 
published results of the RT-PCR carried out indicate, that 

15 the HCV replication in the described HCV transfected 
hepatoma cells is not particularly efficient and is not 
sufficient to measure the quality, let alone the quantity 
of the fluctuations in the replication rate after an 
targeted action with prospective antiviral treatments. 

20 Furthermore it is prior art (Yanagi et al., Proc. Natl. 
Acad. Sci. USA, 96, 2291-95, 1999), that the highly 
conserved 3' NTR is essential for the virus replication. 
This knowledge strictly contradicts the statements of Yoo 
et al. and Dash et al., who used for their experiments only 

25 HCV genomes with shorter 3 1 NTRs since they did not know 
the authentic 3 f end of the HCV genome. 

The object of the present invention is to provide a HCV 
cell culture system, where the viral RNA self-replicates in 
30 the transfected cells with such a high efficiency that the 
quality and quantity of the fluctuations in the replication 
rate can be measured with common methodologies usually 
found in the laboratory after a targeted action with virus 
and prospective HCV specific antivirals in particular. 

35 

The solution to this problem is to provide a cell culture 
system of the prior mentioned type, where the eukaryotic 
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cells are human cells, in particular hepatoma cells, which 

are preferably derived from a normal hepatoma cell line, 

but can also be obtained from an appropriate primary cell 

culture, and where the transfected HCV specific genetic 

5 material is a HCV-RNA construct, which essentially 

comprises the HCV specific RNA segments 5 1 NTR, NS3, NS4A, 

> 

NS4B, NS5A, NS5B, and 3' NTR preferably in the order 
mentioned as well as a minimum of one marker gene for 
selection (selection gene) . 

10 

Here and in the following "NTR" stands for "non-translated 
region" and is a known and familiar term or abbreviation to 
the relevant expert. 

15 Here and in the following the term "HCV-RNA construct" 
comprises constructs, which include the complete HCV 
genome, as well as those, which only include a part of it, 
which means a HCV subgenome. 

20 A preferred variation of the cell culture system according 
to the invention, which had proven to be worthwhile in 
practice, is lodged at the DSMZ, Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH (German collection of 
Microorganisms and Cell Cultures) in Braunschweig, Germany 

25 under the number DSM ACC2394 (laboratory name HuBl 9-13). 

With the cell culture system according to the invention an 
in vitro system is provided for the first time, where HCV- 
RNA is self-replicated and expressed intracellularly and in 

30 a sufficient amount, so that the quantity of the amounts of 
HCV- RNA as well as the HCV specific proteins can be 
determined with conventional and reliably precise 
biochemical measuring methods. This means an almost 
authentic cell-based HCV replication system is available 

35 for the first time, which is urgently needed for the 
development and testing of antiviral drugs. This test 
system provides the possibility of identifying potential 
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targets for an effective HCV specific therapy and 
developing and evaluating HCV specific chemotherapeuticals . 

The invention is based on the surprising finding that 
5 efficient replication of the HCV-RNA only occurs in cells 
if they have been transfected with an HCV-RNA construct, 
which comprises at least the 5 f and the 3' non-translated 
regions (NTR) and the non-structural proteins (NS) 3 to 5B 
and additionally a marker gene for selection (selection 

10 gene) . The structural genes are obviously without great 
importance for replication, whereas efficient replication 
of the HCV-RNA apparently only occurs if the transfected 
cells are subject to permanent selection pressure, which is 
imparted by the marker gene for selection (selection gene) 

15 linked to the HCV-RNA. Consequently the marker gene 
(selection gene) seems on one hand to provoke the selection 
of those cells, where the HCV-RNA replicates productively, 
and it seems on the other hand to considerably increase the 
efficiency of the RNA replication. 

20 

An object of the invention is also a cell-free HCV-RNA 
construct, characterized in that it comprises the HCV 
specific RNA segments 5' NTR, NS3, NS4A, NS4B, NS5A, NS5B,. 
and 3 ! NTR, preferably in the order mentioned, as well as a 
25 marker gene for selection (selection gene) . 

In the present context the terms 5' NTR and NS3 and NS4A 
and NS4B and NS5A and NS5B and 3* NTR comprise each 
nucleotide sequence, which is described in the state of the 
30 art as the nucleotide sequence for each functional segment 
of the HCV genome. 

By providing a HCV-RNA construct such as this, a detailed 
analysis of the HCV replication, pathogenesis and evolution 
35 in cell culture is possible for the first time. The HCV 
specific viral RNA can specifically be created as a 
complete genome or subgenome in any amount, and it is 
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possible, to manipulate the "RNA construct and consequently 
to examine and identify the HCV functions on a genetic 
level. 

5 Because all HCV enzymes identified as a main target for a 
therapy at the moment, namely the NS3/4A protease, the NS3 
helicase and the NS5B polymerase, are included in the HCV- 
RNA construct according to the invention, it can be used 
for all relevant analyses. 

10 

An embodiment of the HCV- RNA construct, which has proven to 
be worthwhile in practical use, stands out by the fact that 
it comprises the nucleotide sequence according to the 
sequence protocol SEQ ID NO:l. 

15 

Further embodiments with similar good properties for 
practical use are characterized in that they comprise a 
nucleotide sequence either according to sequence protocol 
SEQ ID NO: 2 or according to sequence protocol SEQ ID NO: 3 

20 or according to sequence protocol SEQ ID NO: 4 or according 
to sequence protocol SEQ ID NO: 5 or according to sequence 
protocol SEQ ID NO: 6 or according to sequence protocol SEQ 
ID NO: 7 or according to sequence protocol SEQ ID NO: 8 or 
according to sequence protocol SEQ ID NO: 9 or according to 

25 sequence protocol SEQ ID NO: 10 or according to sequence 
protocol SEQ ID NO: 11. 

It is possible to provide the HCV subgenomic construct with 
a 3* NTR, which has a nucleotide sequence so far unknown in 
30 the state of art, a nucleotide sequence, which has been 
selected from the group of nucleotide sequences (a) to (i) 
listed in the following: 



( a ) ACGGGGAGCTAAACACTCCAGGCCAAT AGGCCATCCTGTTTTTTTTTTTAGCTTT 
35 TTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCAGACTGGAG 
T 
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(b) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTC TTTTTTTTGA GAGAGAGAGT CTCACTCTGT TGCCCAGACT 
GGAGC 

(c) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTAATCTTT 
TTTTTTTTCT TTTTTTTTGA GAGAGAGAGT CTCACTCTGT TGCCCAGACT 
GCAGC 

(d) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTAGTC 
TTTTTTTTTT TCTTTTTTTT TGAGAGAGAG AGTCTCACTC TGTTGCCCAG 
ACTGGAGT 

(e) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTT TCTTTTTTTT TGAGAGAGAG AGTCTCACTC TGTTGCCCAG 
ACTGGAGT 

( f ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTT TCTTTTTTTT TTGAGAGAGA GAGTCTCACT CTGTTGCCCA 
GACTGGAGT 

(g) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTT CTTTTTTTTT G A GAG A GAGA GTCTCACTCT GTTGCCCAGA 
CTGGAGT 

(h) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTTAAT 

CTTTTTTTTT TTTTTCCTTT TTTTGAGAGA GAGAGTCTCA CTCTGTTGCC 

-<»- 

CAGACTGGAG T 

( i ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTAATC 
TTTTTTTTTT TTTTCTTTTT TTTTTGAGAG AGAGAGTCTC ACTCTGTTGC 
CCAGACTGGA GT 

The marker gene for selection (selection gene) included in 
the HCV-RNA constructs according to the invention is 
preferably a resistance gene, in particular an antibiotic 
resistance gene . 
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This has the advantage that the cells transfected with this 
construct can easily be selected from the non-transfected 
cells by adding for example the appropriate antibiotic to 
the cell culture medium in the case of an antibiotic 
5 resistance gene. 

In the present context 'antibiotic 1 means any substance, 
which impedes the non-transf ected host cells or the cells, 
where the HCV-RNA is not replicating efficiently, 
10 continuing to live or grow, especially the cell poison 
Puromycin, Hygromycin, Zeocin, Bleomycin or Blasticidin. 

A preferred marker gene for selection (selection gene) and 
resistance gene, which has proven to be worthwhile in 
15 practice, is the neomycin phosphotransferase gene. 

An alternative for the antibiotic resistance genes is for 
example the thymidine kinase gene, which can be used to 
carry out a HAT selection. 

20 

The marker gene for selection (selection gene), the 
preferred resistance gene and the most preferred antibiotic 
resistance gene is preferably positioned in the HCV-RNA 
construct after the HCV 5 1 NTR, which means downstream from 
25 the 5 1 NTR and upstream from the HCV reading frame. 

However, an insertion in the area of the 3 f NTR or another 
site of the HCV genome or subgenome, for example within the 
polyprotein, is also conceivable . 

30 In another embodiment of the HCV-RNA construct according to 
the invention the marker gene for selection (selection 
gene) , in particular an antibiotic resistance gene, is 
linked to the HCV-RNA or HCV genomic or subgenomic sequence 
via a ribozyme or a recognition site for a ribozyme. 

35 

This has the advantage, that after the selection of the 
cells, in which the HCV-RNA is replicating productively, 
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the resistance gene in the obtained cell clones can be 
separated from the HCV subgenomic sequence through a 
ribozyme-dependent cleavage, namely by activating the 
inserted ribozyme or in the case of a construct with a 
5 recognition site for a ribozyme, by transfecting the 
ribozyme into the cells (for example through the 
transfection of a ribozyme construct or infection with a 
viral expression vector, into which the appropriate 
ribozyme has been inserted) . By this means an authentic HCV 
10 genomic construct can be obtained without a resistance 
gene, which can then form authentic infectious virus 
particles. 

Another preferred embodiment of the HCV-RNA construct 
15 according to the invention is characterized in that the 
construct has at least one integrated reporter gene. 

In the following a reporter gene means any gene, whose 
presence can be easily detected with, in general, simple 
20 biochemical or also histochemical methods after being 
transferred into a target organism, which means a gene, 
that encodes for a protein, which can be easily and 
reliably detected and quantified in small amounts with the 
common measuring methods in the laboratory. 

25 

This variation of the HCV-RNA construct has the advantage 
that the extent of the replication of this construct can be 
easily and quickly measured with the methods ' common in the 
laboratory using the reporter gene product. 

30 

The reporter gene is preferably a gene from the group of 
the lucif erase genes, the CAT gene (chloramphenicol acetyl 
transferase gene), the lacZ gene (beta galactosidase gene) , 
the GFP gene (green fluorescence protein gene), the GUS 
35 gene (glucuronidase gene) or the SEAP gene (secreted 
alkaline phosphatase gene) . This reporter gene and its 
products, namely the relevant reporter proteins, can be 
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detected for example using fluorescence, chemiluminescence, 
colorimetric measurements or by means of immunological 
methods (for example ELISA) . 

5 A surrogate marker gene can also be considered as a 
reporter gene. In this context it includes those genes, 
which encode for cellular proteins, nucleic acids or 
generally for those functions, which are subject to 
variation depending on the replication of the virus, and 
10 which consequently are either suppressed or activated in 
the cells, in which the HCV or the HCV-RNA construct 
multiplies. This means, the suppression or activation of 
this function is a surrogate marker for the replication of 
the virus or the replication of the HCV-RNA construct. 

15 

The positions of the reporter genes and the marker gene for 
selection (selection gene) can be selected in such a way, 
that a fusion protein made from both genetic products will 
be expressed. This has the advantage that these two genes 

20 can be arranged in such a way in the HCV-RNA construct that 
their two expressed proteins are fused via a recognition 
sequence for a protease (for example ubiquitin) or via a 
self-cleaving peptide (for example the 2A protein of the 
Picornaviruses) at first and will be separated 

25 proteolytically later. 

These two positions might as well lie apart from each other 
in such way, that both genetic products are separately 
expressed (for example in the order: marker or resistance 
30 gene — internal ribosome binding site — reporter gene) . 

In the case of the reporter gene one embodiment has proven 
to be particularly worthwhile, where the reporter gene is 
cloned into the open reading frame of the HCV genome or 
35 subgenome in such a way that it will only be transferred to 
an active form after proteolytic processing. 
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The cell culture system according to the invention can be 
used for various purposes in each of its embodiments- These 
comprise: 

• The detection of antiviral substances. This can include 
for example: organic compounds, which interfere directly 
or indirectly with viral growth (for example inhibitors 
of the viral proteases, the NS3 helicase, the NS5B RNA- 
dependent RNA polymerase) , antisense oligonucleotides, 
which will hybridize to any target sequence in the HCV- 
RNA construct (for example the 5' NTR) and will have an 
direct or indirect influence on the virus growth for 
example due to a reduction of translation of the HCV 
polyprotein or ribozymes, which cleave any HCV- RNA 
sequence and consequently impair virus replication. 

• The evaluation of any type of antiviral substances in 
the cell culture. These substances can be detected on 
the isolated purified enzyme for example with 'rational 
drug design' or 'high-throughput screening'. Evaluation 
means mainly the determination of the inhibitory 
features of the respective substance as well as its mode 
of action. 

• The identification of new targets of viral or cellular 
origin for a HCV specific antiviral therapy. If for 
example a cellular protein is essential for viral 
replication, the viral replication can also be 
influenced by inhibiting this cellular protein. The 
system according to the invention also enables the 
detection of these auxiliary factors. 

• The determination of drug resistance. It can be assumed 
that resistance to therapy occurs due to the high 
mutation rate of the HCV genome. This resistance, which 
is very important for the clinical approval of a 
substance, can be detected with the cell culture system 
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according to the invention. Cell lines, in which the 
HCV-RNA construct or the HCV genome or subgenome 
replicates, are incubated with increasing concentrations 
of the relevant substance and the replication of the 
5 viral RNA is either determined by means of an introduced 

reporter gene or through the qualitative or quantitative 
detection of the viral nucleic acids or proteins. 
Resistance is given if no or a reduced inhibition of the 
replication can be observed with the normal 
10 concentration of the active substance. The nucleotide 

and amino acid replacements responsible for the therapy 
resistance can be determined by recloning the HCV-RNA 
(for example by the means of RT-PCR) and sequence 
analysis. By cloning the relevant replacement/s into the 
15 original construct its causality for the resistance to 

therapy can be proven. 

The production of authentic virus proteins (antigens) 
for the development and/or evaluation of diagnostics. 
The cell culture system according to the invention also 
allows the expression of HCV antigens in cell cultures. 
In principle these antigens can be used as the basis for 
diagnostic detection methods. 

The production of HCV viruses and virus-like particles, 
in particular for the development or production of 
therapeutics and vaccines as well as for diagnostic 
purposes. Especially cell culture adapted complete HCV 
genomes, which could be produced by using the cell 
cultur system according to the invention, are able to 
replicate in cell culture with high efficiency. . These 
genomes have the complete functions of HCV and in 
consequence they are able to produce infectious. viruses. 

35 The HCV- RNA construct according to the invention by itself 
can also be used for various purposes in all its 
embodiments. This includes first of all: 
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The construction of attenuated hepatitis C viruses or 
HCV-like particles and their production in cell 
cultures : 

Attenuated HCV or HCV-like particles can be created by 
accidental or purposefully introduced mutations, such as 
point mutations, deletions or insertions, which means 
viruses or virus-like particles with complete ability to 
replicate, but reduced or missing pathogenicity. These 
attenuated HCV or HCV-like particles can be used in 
particular as vaccine. 

The construction of HCV-RNA constructs with integrated 
foreign genes, used for example as liver cell specific 
vector in gene therapy. Due to the distinctive liver 
cell tropism of the HCV and the possibility of replacing 
parts of the genome by heterologous sequences, HCV-RNA 
constructs can be produced, where for example the 
structural proteins can be replaced by a therapeutically 
effective gene. The HCV-RNA construct obtained in this 
way is introduced into cells preferably by means of 
transf ection, which express the missing HCV functions, 
for example the structural proteins, in a constitutive 
or inducible way. Virus particles, carrying the HCV-RNA 
construct, can be created by means of this method known 
to the expert under the term ' transcomplementation 1 . The 
particles obtained can preferably be used for the 
infection of liver cells. Within these the 
therapeutically effective foreign gene will be expressed 
and will consequently develop its therapeutic effect. 

The detection of permissive cells, which means cells, in 
which a productive virus growth occurs. For this purpose 
either one of the HCV-RNA genomic constructs previously 
mentioned, which is able to form complete infectious 
viruses, or one of the HCV subgenomes previously 
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mentioned, which according to the previously mentioned 
example will be transfected in a cell line first, which 
expresses the missing functions in a constitutive or 
inducible way, is used. In each case virus particles are 
5 created, which carry a resistance and/or reporter gene 

apart from the HCV sequence. In order to detect cells, 
where the HCV is able to replicate, these cells are 
infected with viruses generated in this way and subject 
to an antibiotic selection or they are examined 

10 depending on the HCV-RNA construct by means of 

determining the presence of the expression of the 
reporter gene. Because an antibiotic resistance or 
reporter gene expression can only be established, when 
HCV-RNA construct replicates, the cells detected in this 

15 way must be permissive. Almost any cell line or primary 

cell culture can be tested in regard to the permissivity 
and detected in this way. 

20 The cell culture system according to the invention also 
permits targeted discovery of HCV-RNA constructs for which 
there is an increase in the efficiency of replication due 
to mutations. This . occurs either by chance, in the context 
of HCV-RNA replication, or by targeted introduction into 

25 the construct. These mutations, leading to a change in the 
replication of the HCV-RNA construct, are known to experts 
as adaptive mutations.*' the invention therefore also 
includes a method for obtaining cell culture adapted 
mutants of a HCV-RNA construct according to the invention 

30 following the above description, in which the mutants have 
increased replication efficiency compared to the original 
HCV-RNA construct. It further includes a method for the 
production of mutants of a HCV-RNA full-length genome or of 
a HCV-RNA subgenome or of any HCV-RNA construct with 

35 increased replication efficiency compared to the original 
HCV-RNA full-length genome or subgenome or HCV-RNA 
construct, as well as cell culture adapted mutants of HCV- 
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RNA constructs, HCV-RNA " full-length genomes and HCV 
subgenomes with increased replication efficiency compared 
to the original constructs, subgenomes or full-length 
genomes. 

5 

The method according to the invention for. the production of 
cell culture adapted mutants of a HCV-RNA construct 
according to the invention, in which the mutants have 
increased replication efficiency compared to the HCV-RNA 

10 construct, is characterised in that a cell culture system 
according to claim 1, in which the transfected HCV specific 
genetic material is a HCV-RNA construct with a selection 
gene according to claims 4 to 19, is cultivated on/in the 
selection medium corresponding to the selection gene, that 

15 the cultivated cell clones are collected and that the HCV- 
RNA construct is isolated from these cell clones. 

In an advantageous extension of this production method, the 
isolated HCV-RNA constructs are passaged at least one more 

20 time, that is they are transfected in cells of a cell 
culture system according to claim 1, the thus obtained cell 
culture system according to claim 1, in which the 
transfected HCV specific genetic material is the isolated 
HCV-RNA construct with a selection gene, is cultivated 

25 on/in the selection medium corresponding to the selection 
gene, the cultivated cell clones are collected and the HCV- 
RNA constructs are thus isolated. 

Using this process variation, the quantity of adaptive 
30 mutations and hence the degree of replication efficiency in 
the relevant HCV-RNA constructs can be increased even 
further. 

The method according to the invention for the production of 
35 mutants of a HCV-RNA full-length genome or of a HCV-RNA 
subgenome or of any HCV-RNA construct with increased 
replication efficiency compared to the original HCV-RNA 
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full-length genome or subgenome or HCV-RNA construct, has 
the following features. Using one of the two production 
methods presented above, a cell culture adapted mutant of a 
HCV-RNA construct is produced, isolated from the cells, 
5 cloned using state of the art known methods and sequenced. 
By comparing with the nucleotide and amino acid sequence of 
the original HCV-RNA construct, the type, number and 
position of the mutations is determined. These mutations 
are then introduced into an (isolated) HCV subgenome or 
10 full-length genome or any HCV-RNA construct, either by 
site-directed mutagenesis, or by exchange of DNA fragments 
containing the relevant mutations. 

A test can be carried out to determine or verify which 
15 mutations actually are responsible for an alteration of 
replication efficiency, particularly an increase in 
replication. In this test the corresponding nucleotide 
and/or amino acid changes are introduced into the original 
HCV-RNA construct and the modified construct is then 
20 transfected in cell culture. If the introduced mutation 
actually leads to an increase in replication, then for a 
HCV-RNA construct with a selectable marker gene, the number 
of resistant cell clones in the artificially mutated 
construct should be noticeably higher compared to the 
25 untreated construct. 

In the case of a construct with a reporter gene, the 
activity or quantity of the reporter should be noticeably 
higher for the artificially mutated construct compared to 
30 the untreated one. 

The cell culture adapted HCV-RNA constructs with high 
replication efficiency according to the invention are 
characterized in that, through nucleotide or amino acid 
35 exchanges, they are derivable from a HCV-RNA construct 
according to one of the claims 4 to 19, and that they are 
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obtainable using one of - the two production processes 
presented above. 

These cell culture adapted HCV-RNA constructs can be used 
to produce any HCV-RNA constructs or HCV full-length or 
subgenomes with increased replication efficiency. Both 
constructs with a selectable resistance gene and constructs 
without one or with a non-selectable reporter gene (e.g. 
luciferase} can be produced in this way, since replication 
of cell culture adapted HCV-RNA constructs can also be 
demonstrated in non-selected cells due to their high 
replication efficiency. 

The cell culture adapted mutants of a HCV-RNA construct or 
HCV-RNA full-length genome or HCV subgenome with high 
replication efficiency compared to the original HCV-RNA 
construct or the original HCV full-length genome are 
characterized in that they are obtainable by a method in 
which the type and number of mutations in a cell culture 
adapted HCV-RNA construct are determined through sequence 
analysis and sequence comparison and these mutations are 
introduced into a HCV-RNA construct, particularly a HCV-RNA 
construct according to one of the claims 4 to 19, or into 
an (isolated) HCV-RNA full-length genome, either by site- 
directed mutagenesis, or by exchange of DNA fragments 
containing the relevant mutations. 

A group of preferred HCV-RNA constructs, HCV full-length 
genomes . and HCV subgenomes with high and very high 
replication efficiency, which are consequently highly 
suitable for practical use is characterised in that it 
contains one, several or all of the amino acid or nucleic 
acid exchanges listed in table 3 and/or one or several of 
the following amino acid exchanges: 1283 arg -> gly , 1383 
glu -> ala , 1577 lys -> arg , 1609 lys -> glu , 1936 
pro -> ser , 2163 glu -> gly , 2330 lys -> glu , 2442 ile 
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-> val. (The numbers refer to the amino acid positions of 
the polyprotein of the HCV isolate conl, see Table 1) . 

5 Special features of the nucleotid sequences according to the sequence listings: 

SEQ ID-NO: 1 

Name: I38 9/Core-3 1 /wt 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
10 2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3 • 1202-1812 : internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
15 4.1813-10842: HCV Polyprotein from Core up to nonstructural 

protein 5B 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

20 1. 2962-4050: envelope protein 2 (E2) ; structural 

protein 

8. 4051-4239: Protein p7 

9. 4240-4890: nonstructural protein 2 (NS2) ; HCV NS2-3 
Protease 

25 10.4891-6783: nonstructural protein 3 (NS3) ; HCV NS3 

Protease /Helicase 
11.6784-6945: nonstructural protein 4A (NS4A) ; NS3 

Protease cofactor 
12.6946-7728: nonstructural protein 4B (NS4B) 
30 13.7729-9069: nonstructural protein 5A (NS5A) 

14.9070-10842: nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA-polymerase 
15.10846-11076: HCV 3' non-translated region 
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SEQ ID-NO: 2 

Name: I337/NS2-3 ? /wt 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
5 2.342-1181: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1190-1800: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
10 4.1801-8403: HCV Polyprotein from nonstructural protein 2 

up to nonstructural protein 5B 

5. 1801-2451: nonstructural protein 2 (NS2); HCV NS2-3 
Protease 

6. 2452-4344: nonstructural protein 3 (NS3) ; HCV NS3 
15 Protease /He li case 

7. 4345-4506: nonstructural protein 4A (NS4A) ; NS3 
Protease cof actor 

8. 4507-5289: nonstructural protein 4B (NS4B) 

9. 5290-6630: nonstructural protein 5A (NS5A) 

20 10.6631-8403: nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA-polymerase 
11.8407-8637: HCV 3' non-translated region 

SEQ ID-NO: 3 

25 Name: I389/NS3-3 1 /wt • - 

Composition (Nucleotide positions) : 
1.1-341: HCV 5 1 non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
30 3 . 1202-1812 : internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-7767: HCV Polyprotein from nonstructural protein 3 

up to nonstructural protein 5B 
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5. 1813-3708: nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

5 1. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 

9. 5995-7767: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA-Polymerase 

10. 7771-8001: HCV 3' non-translated Region 

10 

SEQ ID-NO: 4 

Name: I337/NS3-3 ' /wt 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
15 2.342-1181: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1190-1800: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
20 4.1801-7758: HCV Polyprotein from Nonstructural protein 3 

up to Nonstructural protein 5B 

5. 1801-3696: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 

6. 3697-3858: Nonstructural protein 4A (NS4A) ; NS3 
25 Protease Cofactor «*- 

7. 3859-4641: Nonstructural protein 4B (NS4B) 

8. 4642-5982: Nonstructural protein 5A (NS5A) 

9. 5983-7755: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA-Polymerase 

30 10. 7759-7 989: HCV 3' non-translated Region 



SEQ ID-NO: 5 

Name: I389/NS2-3 1 /wt 

Composition (Nucleotide positions) : 
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1.1-341: HCV 5 r non- translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
5 encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-8418: HCV Polyprotein from Nonstructural protein 2 

up to Nonstructural protein 5B 

5. 1813-2463: Nonstructural protein 2 (NS2) ; HCV NS2-3 
10 Protease 

6. 2464-4356: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 

7. 4357-4518: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

15 8. 4519-5301: Nonstructural protein 4B (NS4B) 

9. 5302-6642: Nonstructural protein 5A (NS5A) 
10.6643-8415: Nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA-Polymerase 
11. 8419-8649: HCV 3' non-translated Region 

20 

SEQ ID-NO: 6 

Name: I389/NS3-3 ■ /9-13F 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
25 2.342-1193: HCV Core Prot-ein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
30 4.1813-77 67: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 9-13F 
5. 1813-3708: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 
35 6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
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Protease Cofactor . . 

7. 3871-4653: Nonstructural protein 4B 

8. 4654-5994: Nonstructural protein 5A 

9. 5995-7767: Nonstructural protein 5B 
5 dependent RNA-Polymerase 

7771-8001: HCV 3' non-translated Region 

SEQ ID-NO: 7 

Name : I389/Core-3 1 /9-13F 
10 Composition {Nucleotide positions) : 

1.1-341: HCV 5* non-translated region 
2.342-1193: HCV Core Prot ein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
15 encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
protein 53 of the cell culture-adapted mutant 9-13F 
5. 1813-2385: HCV Core Protein; structural protein 
20 6. 2386-2961: envelope protein 1 (El); structural 

protein 

7. 2962-4050: envelope protein 2 (E2) ; structural 
protein 

8. 4051-4239: Protein p7 

25 9. 4240-4890: Nonstructural protein 2 (NS2) ; HCV NS2-3 

Protease 

10.4891-6783: Nonstructural protein 3 (NS3) ; HCV NS3 

Protease/Helicase 
11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 
30 Protease Cofactor 

12.6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA-Polymerase 
35 15.10846-11076: HCV 3 f non-translated Region 



(NS4B) 
(NS5A) 

(NS5B); RNA- 
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SEQ ID -NO: 8 

Name: I389/NS3-3 ' /5 . 1 
Composition (Nucleotide positions) : 
1.1-341: HCV 5 1 non-translated region 
5 2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
10 4.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 5.1 
5. 1813-3708: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease /Helicase 
15 6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 

9. 5995-7767 : Nonstructural protein 5B (NS5B) ; RNA- 
20 dependent RNA-Polymerase 

7771-8001: HCV 3 ? non-translated Region 

SEQ ID -NO: 9 

Name: l389/Core-3 1 /5 . 1 
25 Composition {Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase. 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
30 encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
protein 5B of the cell culture-adapted mutant 5.1 
5. 1813-2385: HCV Core Protein; structural protein 
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6. 2386-2961 : envelope- protein 1 (El); structural 
protein 

7. 2962-4050: envelope protein 2 (E2); structural 
protein 

5 8. 4051-4239: Protein p7 

9. 4240-4890: Nonstructural protein 2 (NS2) ; HCV NS2-3 
Protease 

10,4891-6783: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 
10 11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cof actor 
12 . 6946-7728 : Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 
15 dependent RNA-Polymerase 

15.10846-11076: HCV 3' non-translated Region 

SEQ ID-NO: 10 

Name: I389/NS3-3 1 /19 
20 Composition {Nucleotide positions) : 

1.1-341: HCV 5* non-translated region 

2,342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
25 encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 19 
30 5. 1813-3708: Nonstructural protein 3 (NS3) ; HCV NS3 

Protease/Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 
35 8. 4654-5994: Nonstructural protein 5A (NS5A) 
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9. 5995-7767: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA-Polymerase 
7771-8001: HCV 3 f non-translated Region 

5 SEQ ID-NO: 11 

Name: I389/Core-3 T /19 

Composition (Nucleotide positions) : 

1.1-341: HCV 5 T non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 
10 fusion. protein; selectable Marker 

3.1202-1812: internal ribosome entry site from 

encephalomyokardit is virus; directs translation of the 
downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
15 protein 5B of the cell culture-adpated mutant 19 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

7. 2962-4050: envelope protein 2 (E2); structural 
20 protein 

8. 4051-4239: Protein p7 

9. 4240-4890: Nonstructural protein 2 (NS2) ; HCV NS2-3 
Protease 

10.4891-6783: Nonstructural protein 3 (NS3) ; HCV NS3 
25 Protease/Helicase - 

11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cofactor 
12.6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
30 14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA-Polymerase 
15.10846-11076: HCV 3' non-translated Region 
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The invention is described in detail in the following by 
way of examples of embodiments and respective diagrams. 
These are as follows 

5 Fig. 1 A: The structure of a HCV-RNA construct according to 
the invention 

On the top a diagram of the structure of the 
complete parental HCV genome is given with the 
positions of the genes for the cleavage products 
core, El, E2, p7, NS2, NS3, NS4A, NS4B, NS5A, and 
NS5B within the polyprotein f and the 5 1 and 3' 
non-translated regions (5' NTR and 3' NTR) 
shown as a thick horizontal line and with the 
two positions selected for the creation of the 
subgenomic constructs, namely the position of the 
1 GDD catalytic domain 1 of the NS5B RNA polymerase 
(GDD) and the position of the 3' end of the HCV- 
IRES (nucleotide positions 1 to 377 and 1 to 389) 
- drawn above the diagram of the genome. The 
numbers below the diagram of the genome indicate 
the respective nucleotide positions. 

Diagrams of the structure of two modified HCV-RNA 
25 constructs (subgenome) according to the invention 

are shown, below, consisting of the 5 1 HCV-IRES, 
the neomycin phosphotransferase gene (Neo ), the 
EMCV-IRES (E-I) and the HCV sequences of NS2 or 
NS3 up to the authentic 3 1 end. The position of 
30 the 10-amino acid deletion comprising the NS5B 

polymerase GDD motive is marked with a triangle ( 
A) . 



15 



Fig. 1 B: The result of a denaturing f ormaldehyde-agarose 
35 gel electrophoresis for the detection of 

replicating plus-strand RNA in transfected 
subpassaged Huh 7 cell clones. 
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The positions of HCV specific RNAs (arrows) and 
the 28S rRNA are specified to the right of lane 
12 f the size (number of nucleotides) of the RNA 
marker (M) is specified to the left of lane 1. 

Pig. 1 C: The result of a PCR test with subsequent Southern 
Blot to demonstrate the absence of integrated 
replicon DNA in most of the selected cell clones. 



10 



The lanes 1 and 2 show the positive control, lane 
13 the . negative control. The figures to the left 
of lane 1 indicate the size of the nucleotide 
marker molecules. 

15 

Fig. 2 A: The result of a PCR test with subsequent Southern 
Blot for the detection of the sensitive exclusion 
of integrated replicon DNA (plasmid molecule 
I377/NS3-3 1 /wt) in a cell clone containing a HCV- 
20 RNA construct (9-13). 

The lanes 7 to 11 represent the result of a 
titration of DNA molecules of the construct 
I377/NS3-3 1 /wt without addition of total DNA of 

25 the cell clone 9-13, and the lanes 2-6 

represent . the result obtained with the same 
plasmid molecules with the addition of 1 pg 9-13 
DNA each prior to the PCR (for the purpose of 
excluding an inhibitor of the PCR in the DNA 

30 preparation) . Lane 13 represents the negative 

control (PCR without DNA template) . Lane 1 shows 
. the result, which was achieved with one pg total 
DNA of the cell clone 9-13. 



35 Fig. 2 B: The result of a Northern Blot test to quantify 
the amounts of HCV plus- and minus-strand RNA. 



CA 02303526 2000-03-31 

- 31 - 



The arrows mark" the positions of replicon RNA. 

The "plus" and "minus" details indicate the 
positive (plus) and negative (minus) polarity of 
5 the RNA controls, which have been applied to the 

gel. "Minus-strand" and "Plus-strand" indicate 
the specificity of the radioactive RNA probes. 

Fig. 2 C: The result of a f ormaldehyde-agarose gel 
10 electrophoresis after radioactive labeling of the 

intracellular replicated HCV-RNA to demonstrate 
the resistance of HCV-RNA replication to 
dactinomycin . 

15 Fig. 3 A: The detection of HCV specific antigens in the 
selected cell clones by means of 

immunoprecipitation after metabolic radioactive 
labeling. 

20 The lanes 7-9 represent authentic size marker 

(which have been obtained in Huh 7 cells after 
the transient expression of a HCV-RNA construct) ; 
identified HCV proteins are labeled on the left 
edge of lane 1, the molecular weights (in. 

25 Kilodalton) are specified to the right of lane 9. 

Fig. 3 B: Results of an " immunofluorescence test to 
establish the subcellular localisation of HCV 
antigens . 

30 

Fig. 4 : Diagram of the structure of a selectable HCV-RNA 
construct according to the invention (complete 
genome) consisting of the 5' HCV-IRES, the 
neomycin phosphotransferase gene (NeoR) , a 
35 heterologous IRES element, for example from the 

encephalomyocardit is virus (E-I) , the complete 
HCV open reading frame and the authentic 3 f NTR. 
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Fig* 5: Diagram of the structure of HCV-RNA constructs 
with inserted antibiotic resistance gene (A) 
within the nucleotide sequence encoding the 
5 polyprotein (monocistronic RNA construct), and 

(B) within the 3' NTR (bicistronic RNA 
construct ) . 

Fig. 6: Diagram of the structure of HCV-RNA constructs 

10 with inserted reporter genes (A) as part of a HCV 

replicon from NS3 to NS5B, - in the end the 
reporter protein is cleaved by viral or cellular 
proteases out of the polyprotein or the marker 
gene for selection (selection gene) or the 

15 resistance gene are transfected into the cells 

through co-transf ection -, (B) as part of a 
fusion gene composed of a resistance and reporter 
gene (for example for the neomycin 
phosphotransferase and green fluorescent 

20 protein), (C) as part of a replicon composed of a 

resistance and reporter gene (for example the 
neomycin phosphotransferase and green fluorescent 
protein) bound via a nucleotide sequence, which 
encodes for an amino acid sequence (area with 

25 hatches), which can be cleaved by a protease or 

has a self -cleaving (autocatalytic) activity, (D) 
as independent gene (in this case the green, 
fluorescent protein) , which is expressed from its 
own internal ribosome binding site (IRES); — the 

30. resistance gene (in this case the neomycin 

phosphotransferase gene) is also expressed from 
its own internal ribosome binding site (IRES) 
(polycistronic construct) . 



35 Fig. 7: 



Diagram of the structure of a HCV-RNA construct, 
where the resistance gene is linked to the HCV- 
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RNA sequence via a ribozyme or a recognition site 
for a ribozyme. 

The thick lines illustrate the HCV 5' and 3* 
NTRs , E-I is a heterologous internal ribosome 
binding site, which is required for the 
expression of the resistance gene, and the grey' 
square illustrates the ribozyme or a recognition 
site for a ribozyme. 

Diagram of the structure of a HCV-RNA construct 
with resistance gene and integrated foreign gene. 

Method for comparing the specific infectivity 
(expressed as number of cell colonies obtained) 
of total RNA against in vitro transcripts. HCV- 
RNA is generated by in vitro transcription of a 
corresponding RNA construct and quantified by 
measurement of the optical density at 260 nm (OD 
26G nm) . A defined number of these molecules is 
mixed with a specified amount of total RNA from 
naive Huh-7 cells and this mixture is transfected 
into naive Huh-7 cells with the aid of 
elect roporation. At the same time the total RNA 
of a cell clone, produced by the method described 
in Figure 1, is isolated using a known state of 
the art method and the amount of HCV-RNA 
contained therein is determined by means of 
Northern Blot using a HCV specific RNA probe and 
subsequent quantification via phosphoimaging. A 
defined amount of this total RNA is analogously 
transfected in naive Huh-7 cells. These cells 
from both the cultures are then subjected to a 
G418 selection and the number of colonies created 
is determined by counting after fixing and 
staining with Coomassie Brilliant Blue. For the 
determination of transfection efficiency lpg of a 



CA 02303526 2000-03 

- 34 - 



-31 



plasmici allowing the expression of luciferase is 
added to each transfection culture. An aliquot of 
the transfected cells is collected after 24 hours 
and the luciferase activity determined in the 
5 respective cell lysates. The number of colonies 

is always normed to the luciferase expression. 

Fig. 10: Sequence analysis of the 9-13 clone. Total RNA of 
the 9-13 cell clone, resulting from transfection 

10 of HCV-RNA construct I377/NS3-3 1 , was isolated 

using a known state of the art method and the 
HCV-RNA construct amplified from nucleotide 
position 59 to 9386 with the aid of 'long- 
distance RT-PCR 1 using primer S59 and A9413. The 

15 PCR fragments were cloned and 11 clones (9-13 A - 

K) completely sequenced. Clones D and I, E and G 
as well as H and J turned out to be identical, 
respectively. The positions of the amino acid 
differences in the NS3-5B region between the 

20 recloned HCV-RNAs and the parental construct are 

marked with a thick vertical mark for each clone. 
Each clone was digested with restriction enzyme 
Sfll and the respective fragment inserted in the 
parental construct. These clones were each 

25 transfected in Huh-7 cells and the cells 

subjected to selection as described in Figure 1. 
The number of" cell clones obtained with each 
construct is noted next to the respective 
construct on the right. 

30 

Fig. 11 A: Principle of determination of replication with 
the aid of a reporter gene. In the upper part of 
the figure, the HCV-DNA construct I 38 9/Luc/NS3-3 1 
is shown. It consists of the HCV 5' NTR 
35 (nucleotide positions 1-389), the luciferase gene 

(2uc), the IRES of the encephalomyocarditis 
virus, the HCV NS3-5B and the 3' NTR. The 
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position of the active centre of the NS5B RNA 
polymerase, into which a deactivating amino acid 
exchange was introduced, is indicated by f GND 1 . 
The plasmids, which code for the HCV-RNA 
5 construct which is able to replicate or is 

defective, are digested with the restriction 
enzyme Seal and added to an in vitro 
transcription reaction with T7 RNA polymerase. 
After removal of the template DNA, the respective 
10 HCV-RNA constructs were transfected in naive Huh- 

7 cells by means of electroporation and the 
latter collected at regular intervals. 

Fig. 11 B: Comparison of luciferase activity in cells 
15 transfected with the parental HCV-RNA construct 

I 38 s/Luc/NS3-3 ' /wt (wt) or the following variants: 
inactive RNA (318 DN) , variants 9-13F or variant 
5.1. The cells were collected 6 (not shown), 24, 
48, 72, 96, 120, 144 and 168 hours after 
20 transfection and luciferase activities determined 

by luminometric measurement. 

Fig. 12: Selectable HCV full-length genomes (constructs 
I 389 /core 3V5.1 and I 389 /core 3V9-13F). 
25 (A) Diagram of the full-length construct. The 

region between both indicated recognition sites 
for the restriction enzyme Sfil corresponds to 
the sequences of the highly adapted RNA variants 
5.1. or 9-13F. 

30 (B) Number of colonies which were obtained after 

transfection of 0.1 pg in vitro transcribed RNA 
of the construct I 3B 9/core-3 V5 . 1 . described under 
A into HUH7-cells. The result of a representative 
experiment is given. 

35 (C) Demonstration of autonomously replicating HCV 

full-length RNAs in G418 resistant cell clones 
which were obtained after transfection of the 
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corresponding in vitro transcript. The 
illustration shows the autoradiogram of a 
Northern Blot, which was hybridised with a probe 
against the neo-resistance gene and the HCV 5' 
5 NTR . The controls shown in lanes 1 and 2 each 

correspond to 10 8 molecules of the indicated in 
vitro transcripts, mixed with total RNA from 
naive Huh-7 cells. The negative control contains 
only total RNA from naive Huh-7 cells (lane 3) . 

10 Lanes 4-9 contain 3-10 pg total RNA from G418 

resistant cell clones, which were obtained after 
transfection by in vitro transcribed I 38 $/core 
3V5.1 RNA or I 3B s/core 3V9-13F RNA. The G418 
concentration used for the selection is indicated 

15 in each case. Five of the cell clones shown 

contain the highly adapted RNA variant 5.1 (lanes 
4-8), one contains the adapted RNA variant 9-13F 
(lane 9) . 

20 Fig. 13: HCV- RNA constructs with a reporter gene. (A) 
Bicistronic HCV-RNA constructs. The reporter gene 
is translated with the aid of a separate IRES. 
(B) Monocistronic HCV-RNA constructs. The 
reporter gene product is expressed as fusion 

25 protein with a HCV protein- Both portions are 

linked via a recognition sequence for a viral or 
cellular protease, which permits a proteolytic 
separation of the two fused protein portions. In 
the example shown the reporter gene product and 

30 respective HCV protein was fused through a 

recognition sequence for ubiquitin (Ub) . 

Fig. 14: Tricistronic full-length HCV-RNA construct, that 
in addition to the resistance gene possesses an 
35 inserted foreign gene (ther. gene) . 
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Fig- 15: Monocistronic HCV-RNA constructs, for which the 
resistance gene product is expressed as a fusion 
protein with HCV portion. The resistance gene 
(RG) is either active as a fusion protein or it 

5 is fused with the HCV portion via a 

proteolytically cleavable sequence in such a way 
that the resistance gene product is split from 
the HCV portion by a cellular or viral protease. 
In the example shown the resistance gene was 

10 fused with the respective HCV portion through the 

sequence coding for ubiquitin (Ub) . 



15 Example 1 : Production of HCV-RNA constructs 

(A) Synthesis and cloning of a complete HCV consensus 
genome by means of RT-PCR 

20 The HCV genome, which means the HCV-RNA, was isolated from 
the liver of a chronically infected patient as described in 
the following: 

The total RNA was isolated from approx. 100 mg of liver 
25 according to the method described by Chomczynski and Sacci 
(1987, Anal. Biochem. 162, 156). Using 1 pg of this 
isolated RNA a reverse transcription was carried out with 
the primers A6103 (GCTATCAGCCGGTTCATCCACTGC) or A9413 
(CAGGATGGCCTATTGG CCTGGAG) and the expand reverse 
30 transcriptase system (Boehringer Mannheim, Germany) 
according to the manufacturer's recommendations. A 
polymerase chain reaction (PCR) was carried out with the 
products of this reverse transcription using the expand 
long template system (Boehringer Mannheim, Germany), in 
35 which the buffer containing 2 % of dimethyl sulfoxide was 
used. After one hour at 42 °C, 1/8 of the reaction mixture 
was used for a first PCR with primers A6103 and S59 
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{ TGTCTTCACGCAGAAAGCGTCTAG ) or A9413 and S4 542 (GATGAGCT 
CGCCGCGAAGCTGTCC) . After 40 cycles, 1/10 of this reaction 
mixture was used for a second PCR with primers S59 and 
A4919 (AGCACAGCCCGCGTCATAGCACTCG) or S4542 and A938 6 
5 (TTAGCTCCCCG TTCATCGGTTGG . After 30 cycles the PCR products 
were purified by means of preparative agarose gel 
electrophoresis and the eluted fragments were ligated into 
the vector pCR2 . 1 (Invitrogen) or pBSK II ( Stratagene) . 
Four clones of each fragment were analyzed and sequenced, 

10 and a consensus sequence was established- For this purpose 
the DNA sequences were compared to each other. The 
positions,, where the sequence of one fragment was different 
from the others, were considered as undesired mutations. In 
the case of ambiguities of the sequence, shorter 

15 overlapping PCR fragments of the respective region were 
amplified and several clones sequenced. By this means 
several potential mutations could be identified in each 
fragment and consequently an isolate specific consensus 
sequence could be established. This established consensus 

20 sequence or genome belongs to the worldwide spread genotype 
lb. The non-translated region at the 3 f end (=3' NTR) was 
obtained by means of a conventional PCR, whereby an 
antisense primer, which covers the last 24 nucleotides of 
the 'X-tail' known from the state of the art was used 

25 {Tanaka et al . , 1995, Biochem. Biophys. Res. Commun. 215, 
744; und Rice, PCT/US 96/14033). The authentic non- 
translated region on the*5' end (=5' NTR) downstream of . the 
T7 promoter was created by means of PCR, whereby an 
oligonucleotide, which corresponds to a shortened T7 

30 promoter (TAA TAC GAC TCA CTA TAG) and the first 88 
nucleotides of HCV, was used on one hand and one of the 
previously mentioned plasmids, which carries one of the 5 1 
fragments of the genome, was used on the other hand. A 
complete HCV consensus genome was assembled from the 

35 subgenomic fragments with the smallest number of non- 
consensus replacements and inserted into a modified pBR322 
vector. Deviations from the consensus sequence were 
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eliminated by means of site-directed "mutagenesis . In order 

to produce run-off transcripts with an authentic 3' end, 

the 3' NTR of the isolates (with the end TGT) was modified 

to AGT (according to. the sequence of the genotype 3 = clone 

5 f WS' according to Kolykhalov et al. f 1996, J. Virol. 70, 

3363) and an additional nucleotide replacement was carried 

# 

out at position 9562, to preserve the A: T pairing in the 
hairpin structure at the 3 1 end of the 3* NTR (Kolyhalov et 
al. ibid.). In order to eliminate an internal restriction 
10 site for the Seal enzyme, a silent nucleotide replacement 
was further carried out. After joining the full-length 
genome with the appropriate 5 1 and 3 9 NTRs the complete HCV 
sequence was analyzed. No undesired nucleotide replacement 
was detected. 

15 

The HCV genome produced in this way should be hepatotropic 
according to the definition. 

(B) Synthesis of selectable HCV subqenomic constructs 

20 

By means of the consensus genome described under (A) , HCV 
subgenomic constructs were created, which include the 
antibiotic resistance gene neomycin phosphotransferase 
(NPT) and two sequences of internal ribosome entry sites 

25 (IRES) . The biochemical procedures used for this are known 
and familiar to the expert (see: Sambrook, J., E.F. 
Fritsch, T. Maniatis, 1989, Molecularcloning : a laboratory 
manual, 2nd ed., Cold Spring Harbour Laboratory, Cold 
Spring Harbor, N.Y.; Ausubel et al. (eds.), 1994, Current 

30 Protocols in Molecular Biology, Vol. 1-3, John Wiley & Sons 
Inc., New York). The antibiotic resistance gene was 
inserted immediately after the 5' NTR, obtaining a 
bicistronic RNA as a result (see Fig. 1 A) . However, the 
antibiotic resistance gene might as well be inserted at 

35 another site of the HCV subgenomic construct, for example 
within the nucleotide sequence encoding for the 
polyprotein, obtaining a monocistronic RNA as a result (see 
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Fig. 5 A) r or in the 3' NTR (see Fig. 5 B) . The IRES 
elements are one of the two HCV-IRES variants from 
nucleotides 1-377 or nucleotides 1-389 as well as the IRES 
of the encephalomyocarditis virus, which controls the 
5 translation of the HCV sequence downstream from the genes 
for NS2 or NS3 up to the authentic 3 f end of the genome. 

* 

The two HCV-IRES variants mentioned were determined in the 
following way: 

10 

Based on deletion analyses of the 3' end of the HCV-IRES 
(Reynolds et al.1995, EMBO J. 14, 6010) various segments of 
the 5' NTR were fused with the NPT gene and analyzed with 
regard to the maximum number of colonies formed by means of 

15 co-transfection with a plasmid containing the T7 RNA 
polymerase gene. The best results were achieved with the 
HCV sequences from 1-377 to 1-389. Due to the fact that the 
AUG start codon of the HCV polyprotein is located at 
position 342 and is. consequently included in the IRES 

20 sequence^ 12 or 16 amino acids of the HCV core protein 
("core protein") were fused with the neomycin 
phosphotransferase (see Fig. 1 A) . 

Accordingly, these modified HCV subgenomic constructs were 
25 given the designations I 3 77/NS2-3' (or I 37 7/NS3-3') and 
I3B9/NS2-3 1 (or I389./NS3-3' ) . They are illustrated in a 
diagram in Fig. 1A. 

Different cell lines and primary cell cultures of human 
30 hepatocytes were transfected with in vitro transcripts of 
these modified parental HCV subgenomic constructs I 377 /NS2- 
3' (or I 3 77/NS3-3') and I389/NS2-3 , (or I 38 9/NS3-3 • ) . 

As a parallel negative control for all transfection 
35 experiments, an appropriately modified, but defective 
subgenome was constructed for every modified parental HCV 
subgenomic construct, which differs from the parental 
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construct due to the fact, that it has a deletion of 10 
amino acids within the reading frame comprising the active 
site of the NS5B RNA polymerase (Behrens et al., 1996, EMBO 
J. 15, 12; and Lohmann et al., 1997, J". Virol. 71, 8416). 

5 

(C) Synthesis of selectable HCV genomic constructs 

A NS2-3' subgenomic construct, linked at its 5' end to a 
fragment of the luciferase gene and the complete EMCV-IRES, 

10 was restricted with Ncol and Spel and purified using 
preparative agarose gel electrophoresis. The vector 
obtained in this way was ligated with a Ncol/Not I-HCV 
fragment corresponding to the nucleotide positions 342 to 
1968 of the HCV genome and a Notl/Spel fragment 

15 corresponding to the nucleotide positions 1968-9605 in a 3- 
factor-ligation. The resulting construct, where the 
complete HCV open reading frame and the 3 ! NTR lie 
downstream from the luciferase gene fragment and the EMCV- 
IRES, was then restricted with Pmel and Spel and ligated 

20 with the analogously restricted I 399 /NS3-3 ' /wt subgenomic 
construct vector. This selectable HCV genomic construct is 
illustrated in Fig. 4. 

(D) Production of in vitro transcripts corresponding to the 
25 HCV-RNA constructs 

The previously described purified plasmid DNAs were 
linearized with Seal and used for an in vitro transcription 
reaction after phenol/chloroform extraction and isopropanol 

30 precipitation using the following components: 80 mM HEPES, 
pH 7.5, 12,5 mM MgCl 2 , 2 mM Spermidine, 40 mM 
Dithiothreitol, 2 mM of each NTP, 1 unit RNasin/pl, 50 
Vig/ml restricted DNA and approx. 2 units/pl T7 RNA 
polymerase. After 2 hrs . at 37 °C half of the amount of T7 

35 polymerase was added and the reaction mixture was incubated 
for two further hours. In order to remove DNA the mixture 
was extracted with acid phenole (U. Kedzierski, J.C. Porte, 
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1991, Bio Techniques 10, 210), precipitated with 
isopropanol, the pellet was dissolved in water and 
incubated with DNase (2 units per pg DNA) for 60 min. at 37 
°C- After subsequent extraction with acid phenole, acid 
5 phenole/chlorof orm and chloroform as well as isopropanol 
precipitation the dissolved RNA was quantified using 
optical density measurement and its integrity was checked 
using f ormaldehyde-agarose gel electrophoresis. 

10 

Example 2 : Transfection experiments with the hepatoma cell 
line Huh 7 

With all transfection experiments it was carefully ensured 
15 that any template DNA had been removed beforehand so as to 
avoid the possibility of this DNA integrating in 
transfected cells and conferring a neomycin resistance to 
them independent from HCV replication. The reaction mixture 
was therefore treated with 2 units of DNase per pg DNA for 
20 60 min. at 37 °C after the in vitro transcription and 
extracted with acid phenole, acid phenole/chlorof orm and 
chloroform. Prior to being used for the transfection the 
precipitated RNA was analyzed using f ormaldehyde-agarose 
gel electrophoresis. 

25 

Three separate transfection experiments were carried out 
with the highly differentiated human hepatoma cell line Huh 
7 (according to Nakabayashi et al . 1982, Cancer Res. 42, 
3858) . Each time 15 ]ig RNA were introduced into 8 x 10 6 Huh 

30 7 cells by electroporation and the cells were seeded in 
culture dishes with a diameter of 10 cm. 24 hours after 
seeding, neomycin (= G418) was added in a final 
concentration of 1 mg/ml. The culture medium was changed 
twice a week. After 3-5 weeks small colonies were 

35 visible, which were isolated and grown under the same 
culture conditions. 
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The cell clones obtained during the first experiment were 
isolated and subpassaged. Most of the clones died during 
this procedure, and the final yield was only 9 clones from 
cells, which had been transfected with the parental HCV 
5 subgenomic constructs and 1 clone (clone 8-1) from cells, 
which had been transfected with a defective HCV genomic 
construct, namely a defective NS2-3' HCV-RNA. Apart from an 
extended doubling time and the occasional occurrence of 
irregularly shaped cells, no consistent morphological 
10 differences were found between these 9 cell clones and the 
single cell clone (clone 8-1) or the parental Huh 7 cells. 

The. main criteria for functioning HCV genomic constructs 
are the formation of viral RNA with the correct size and 
15 the absence of (integrated) plasmid DNA, which could 
transfer or mediate on a G418 resistance. 

To determine the HCV- RNA in the Huh 7 cells/ the total RNA 
was isolated and analyzed by means of the common Northern 

20 Blot method using a plus-strand specific ribo probe (RNA 
probe) . For this purpose the total RNA was isolated from 
the respective cell clones according to the method 
described by Chomczynski and Sacchi 1987, Anal. Biochem. 
162, 156 and 10 ug RNA, which is equivalent to the total 

25 RNA content of 0,5 -1 x 10 5 cells, are separated using 
denatured f ormaldehyde-agarose gel electrophoresis (lanes 3 
to 12 of Fig. IB). At the same time 10 9 in vitro 
transcripts (ivtr.), which correspond to the I389/NS2-3 ' /wt 
or the I 38 9/NS3-3' /wt replicon RNAs , are separated as well 

30 as size markers with authentic sequence (lane 1 or lane 2) . 
The separated RNA was transferred onto nylon-based 
membranes and hybridized with a radioactively labeled plus- 
strand specific RNA probe, which was complementary to the 
complete NPT gene and the HCV-IRES of nucleotide 377 to 

35 nucleotide 1. The positions of the HCV specific RNAs 
(arrows) and the 28S rRNA are specified to the right of 
lane 12, the size (amount of nucleotides) of the RNA marker 
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is specified to left of lane 1. The RNA marker fragments 
contain HCV sequences and therefore hybridize with the ribo 
probe (= RNA probe) . The results of this analysis are 
illustrated in Fig. 1 B. 

5 

With the exception of clone 8-1 transfected with the 
defective HCV genomic construct, all cell clones produced 
homogenous HCV-RNAs of correct lengths (approx. 8640 
nucleotides in the case of the NS2-3' and approx. 7970 

10 nucleotides in the case of NS3-3' replicon) . This result is 
an indication of the fact, that the functional replicons or 
the functional HCV genomic, constructs transfer the G418 
resistance. In order to exclude, that the G418 resistance 
is caused by a plasmid DNA, which is integrated in the 

15 genome of the Huh 7 host cell and transcribed under control 
of a cellular promoter, the DNA of each clone was analyzed 
by means of a NPT gene specific PCR. Consequently the DNA 
was isolated from the selected Huh 7 cell clones by means 
of digestion with proteinase K (40ug/ml, Ih, 37 °C) in 

20 lOmMTris, pH7,5, ImM EDTA, 0,5 % SDS and subsequent 
extraction with phenol, phenol/chloroform and isopropanol 
precipitation. The DNA precipitate was dissolved in 10 mM 
Tris (pH 7,5) and 1 mM EDTA and incubated with Rnase A for 
1 hour. Following a phenol/chloroform extraction and 

25 ethanol precipitation, 1 ug DNA, equivalent to 4 - 8 x 10 4 
cells, was analyzed by means of PCR using NPT gene specific 
primers (5 1 -TCAAGACCGACCTG TCCGGTGCCC-3 « and 5 f - 
CTTGAGCCTGGCGAACAGTTCGGC-3 1 ) , and a DNA fragment consisting 
of 379 nucleotides was generated. The specificity of the 

30 PCR product was established by means of the Southern Blot 
method, in which a DNA fragment labeled by digoxigenin was 
used, which corresponds to the NPT gene. As positive 
controls (for the detection of possibly contaminating 
nucleic acids) the PCR method was carried out with 10 7 

35 plasmid molecules or 1 ug DNA from a BHK cell line, which 
was stably transfected with a neomycin resistance gene, and 
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as negative control, the PCR was carried out with the same 
reagents but without added DNA. 

The results of this analysis are illustrated in Fig. 1 C. 
5 The lanes 1 and 2 represent the positive controls, lane 13 
represents the negative control. The numbers to the left of 
lane 1 indicate the sizes of the nucleotide marker 
molecules . 

10 A NPT-DNA could not be detected in any cell clone, apart 
from clone 7-3 (Fig.lC, lane 3) r which was obtained from 
cells after transfection with a NS2-3' replicon/NS2-3 ? HCV 
genomic construct, and clone 8-1 (Fig.lC, lane 12), which 
was obtained from cells after transfection with a defective 

15 HCV genomic construct. This result was another indication 
of the fact, that the G418 resistance of most clones was 
passed on by the replicating HCV-RNA. But even regardless 
of these results, it is unlikely, that integrated plasmid 
DNA produces HCV-RNAs of correct sizes, because the 

20 plasmids used for the in vitro transcription contain 
neither an eukaryotic promoter nor a polyadenylat ion 
signal. In the case of clone 7-3 the resistance is 
therefore very probably passed on by the HCV-RNA construct 
or the replicating HCV-RNA as well as by an integrated NPT 

25 DNA sequence, whereas the resistance of the cells of clone 
8-1 is only caused by the integrated plasmid DNA. 

Clone 9-13 (Fig. 1 B, lane 11) was subject to further 
tests, to confirm that the G418 resistance was passed on by 

30 a self-replicating HCV-RNA. Clone 8-1, which carries the 
integrated copies of the NPT gene, was used throughout as 
negative control. A PCR was carried out, which allows the 
detection of < 1000 NPT gene copies in ~ 40.000 cells, with 
the aim to rigorously exclude the presence of NPT-DNA in 

35 clone 9-13. The result of this PCR is illustrated in Fig. 
2A. The PCR proceeded in detail as follows: 
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During the test, 10 s - 10 2 plasmid molecules (I37-7/NS3- 
3'/wt) were used either directly (lanes 7 - 11) or after 
adding each 1 pg 9-13 DNA (lanes 2 - 6) . The specificity of 
the amplified DNA fragment was determined by Southern Blot 
5 using a NPT specific probe. A PCR without DNA probe was 
carried out as negative control (lane 12) . 

Even with this sensitive method, no plasmid DNA could be 
detected in one pg DNA of the cell clone 9-13 (lane 1). To 

10 estimate the amount of HCV plus- and minus-strand RNAs in 
these cells, a dilution series of total RNA was analyzed 
with the Northern Blot method using a plus- or minus-strand 
specific radioactively labeled ribo. probe (= RNA probe) . 
For this purpose, 8, 4 or 2 pg of total RNA, which have 

15 been isolated from the cell clones 9-13 and 8-1, were 
analyzed in parallel to known amounts of in vitro 
transcripts with plus- or minus-strand polarity (control 
RNAs) in the Northern Blot method and were then subjected 
to a hybridization. The hybridization was carried out with 

20 a plus-strand specific ribo probe, which covered the 
complete NPT gene and the HCV-IRES ('plus-strand', top 
panel), or with a minus-strand specific RNA probe, which 
was complementary to the NS3 sequence ('minus-strand', 
bottom panel) . The arrows mark the positions of replicon 

25 RNA. The results of this analysis are illustrated in Fig. 
2 B. 

Approx. 10 8 copies/pg total RNA were detected in the case 
of the plus-strand, which is equivalent to 1000 - 5000 HCV- 
30 RNA molecules per cell, whereas the amount of minus-strand 
RNA was 5- to 10-times less. This result corresponds to the 
assumption, that the minus-strand RNA is the replicative 
intermediate form or intermediate copy, which is used as a 
template for the synthesis of the plus-strand molecules. 

35 

Due to the fact that the reaction is mainly catalyzed by 
the viral RNA-dependent RNA polymerase, the synthesis of 
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the HCV-RNAs should be resistant to dactinomycin, an 
antibiotic, which selectively inhibits the RNA synthesis of 
DNA templates, but not the RNA synthesis from RNA 
templates. To confirm this assumption, cells were incubated 
5 with [ 3 H] uridine in the presence of dactinomycin, the 
radioactively labeled RNAs were extracted, separated by 
means of denaturing agarose gel electrophoresis and 
analyzed with the aid of a common Bio-Imager using a [ 3 H]- 
sensitive screen. For this purpose approx. 5 x 10 5 cells of 

10 the clones 9-13 and 8-1 were incubated at a time with 100 p 
Ci [ 3 H] uridine for 16 hrs. in the absence (-) or presence 
( + ) of 4 pg/ml of dactinomycin (Dact) . Following this 
labeling reaction the total RNA was prepared and analyzed 
by means of f ormaldehyde-agarose gel electrophoresis. Only 

15 1/10 of the total RNA is illustrated in the first two 
lanes. The radioactively labeled RNA was visualized using a 
BAS-2500 Bio-Imager (Fuji) . 

The results of this analysis are illustrated in Fig. 2 C. 

20 In accordance with the inhibitor profile of the NS5B 
polymerase (Behrens et al., 1996, EMBOJ. 15, 12 and Lohmann 
et al., 1997, J. Virol. 71, 8416), dactinomycin had no 
influence on the replication of the HCV RNA, whereas the 
synthesis of cellular RNA was inhibited. A RT-PCR was 

25 carried out for recloning the replicating sequences, to 
confirm the identity of the viral RNA. The sequence 
analysis of the recloned RNA showed that the RNA in clone 
9-13 is HCV specific and corresponds to the transfected 
transcript of the HCV construct I377 

/NS3-3'/wt. 

30 

For the analysis of the viral proteins, at first the 
respective cells were metabolically radioactively labeled 
with [ 35 S] methionine/cysteine, then lysed and afterwards 
the HCV specific proteins were isolated from the cell 
35 lysates by means of immunoprecipitation . The results of 
these analyses are illustrated in Fig. 3 A. The detailed 
procedure was as follows: Cells of the cell clones 9-13 
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(wt) and 8-1 (A) were metabolically radioactively labeled 
by treating them with a protein labeling mixture familiar 
to the expert and available on the market (for example. NEN 
Life Science) . The HCV specific proteins were separated 
5 from the cell lysate by immunoprecipitation (IP) under non- 
denaturing conditions (for example according to 
Bartenschlager et al., 1995, J. Virol. 69, 7519) using 
three different antisera (3/4, 5A, 5B, according to the 
labeling on the top end of the lanes 1 to 12) . The immune 

10 complexes were analyzed by means of tricine SDS-PAGE and 
made visible by means of autoradiography. To obtain 
authentic size markers, the homologous replicon construct 
I 377 /NS3-3 7wt was subject to a transient expression by the 
vaccinia virus-T7 hybrid system in the Huh 7 cells. The 

15 resulting products were used as size markers (lanes 7-9) 
parallel to the cells of the clones 9-13 and 8-1. 
Identified HCV proteins are labeled on the left edge of 
lane 1, the molecular weights (in Kilodalton) are specified 
on the right edge of lane 9. It should be noted that the 

20 NS3/4 specific antiserum ('3/4') used preferably reacts 
with NS4A and NS4B causing an underrepresentation of NS3. 

All viral antigens could unambiguously be detected, and 
their apparent molecular weights did not show any 

25 difference to those being detected after transient 
expression of the same bicistronic HCV-RNA construct in the 
original Huh 7 cells. An immunofluorescence detection 
reaction was carried out using NS3 and NS5A specific 
antisera, to determine the subcellular distribution of the 

30 viral antigens (for example according to Bartenschlager et 
al., 1995, J". Virol. 69, 7519). For this purpose cells of 
the clones 9-13 (wt) and 8-1 (A) were fixed with 
methanol/acetone 24 hrs. after incubating on coverslips and 
incubated with polyclonal NS3 or NS5A specific antisera. 

35 The bound antibodies were made visible with a commercially 
available FITC conjugated anti-rabbit antiserum. The cells 
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were counters tained with the 'Evans Blue* stain to suppress 
unspecific fluorescence signals. 

The results of this detection test are illustrated in Fig. 
5 3 B. A strong fluorescence in the cytoplasm could be 
detected with both antisera. The NS5A specific antiserum 
also caused a slight nuclear fluorescence, which indicates 
that at least small amounts of this antigen also reach the 
nucleus. But the generally dominating presence of the viral 
10 antigens in the cytoplasm are a strong indication that HCV- 
RNA replication occurs in the cytoplasm, as is the case 
with most RNA viruses. 

These results prove clearly that the establishment of a 
15 cell culture system for the HCV could be accomplished - with 
the test arrangement described, the efficiency of which 
surpasses everything known up until now by far and for the 
first time allows the detection of viral nucleic acids and 
proteins with conventional and approved biochemical 
20 methods. This efficiency actually. allows detailed 
examination of HCV pathogenesis, genetic analyses of 
different HCV functions and a precise study of the 
virus/host cell interaction, through which new starting 
points for the development of a antiviral therapy can be 
25 defined. 

Example 3 : Transfection of Huh 7 cells with HCV total 
constructs 

30 

Huh7 cells are transfected and selected as described in 
example 2, whereby in this case selectable constructs are 
used, which contain the complete virus genome. 
Corresponding to Example 2, the resulting cell clones are 
35 tested for the absence of HCV-DNA by means of PCR and the 
productive replication of HCV- RNA is then established by 
means of Northern Blot, [ 3 H] uridine labeling in the 
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presence of dactinomycin, detection of the viral proteins 
or antigens preferably with the aid of the Western Blot, 
the immunoprecipitation or immunofluorescence. In contrast 
to the arrangements described in Example 2, the construct 
5 described here makes it possible to obtain more complete 
and very likely infectious viruses, which has not been the 
case in the subgenomic constructs described in Example 2. 
These viruses existing in the cell and the cell culture 
supernatant are concentrated for example by means of 

10 ultracentrif ugation, immunoprecipitation or precipitation 
with polyethyleneglycol, and all exogenous nucleic acids, 
which means those that are not incorporated into, the virus 
particle, are digested by incubation with nucleases (RNase, 
DNase, micrococal nuclease) . In this way all contaminating 

15 nucleic acids, which are not included in the protecting 
virus particle, can be removed. After inactivation of the 
nucleases, the protected viral RNA is isolated for example 
by means of incubation with proteinase K in a buffer 
containing SDS, by extracting with phenol and 

20 phenol/chloroform and detected by means of Northern Blot or 
RT-PCR using HCV specific primers. Also in this test 
arrangement, the combination of the HCV consensus genome 
described with a selection marker was crucial for the 
efficient production of viral RNA, viral protein and 

25 therefore HCV particles. 

Example 4 : Production and application of a HCV- 
RNA construct, whereby the resistance gene is 
30 linked to the HCV subgenomic sequence via a 

ribozyme or a recognition site for a ribozyme. 

A HCV- RNA construct is produced according to Example 1 or 
Example 3, where an antibiotic resistance gene is linked to 
35 the HCV- RNA sequence through a ribozyme or a recognition 
sequence for a ribozyme. These constructs are illustrated 
in a diagram in Fig- 7. Huh 7 cells are transfected with 
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this HCV-RNA construct as described in Example 2. A 
selection with the appropriate antibiotic follows the 
transfection into the cells. The inserted ribozyme is 
activated in the cell clones obtained in the procedure or, 
5 in the case of a construct, which carries a recognition 
sequence for a ribozyme, the ribozyme is trans fected into 
the cell (for example by means of transfection of a 
ribozyme construct or infection with a viral expression 
vector, into which the respective ribozyme has been 

10 inserted). In both cases the resistance gene is separated 
from the HCV-RNA sequence by the ribozyme-dependent 
cleavage. The result in the case of the HCV genome is an 
authentic HCV genome without a resistance gene, which can 
form authentic infectious virus particles. A HCV replicon 

15 without resistance gene is created in the case of the HCV 
subgenomic constructs. 

Example 5 ; Co- transfection of a HCV-RNA construct with a 
20 separata lucif erase transfection construct 

A HCV-RNA construct is produced according to Example 1 (A) 
or Example 3 or Example 4. At the same time a transfection 
construct is produced, which comprises the luciferase gene, 

25 whereby this luciferase gene is linked to a first 
nucleotide sequence, which encodes a HCV protease (for 
example NS3 protease) cleavage site, to a second nucleotide 
sequence, which encodes for another protein or a part of 
another protein. HCV-RNA construct and transfection 

30 construct are transfected into any host cells, preferably 
hepatoma cells, most preferably Huh 7 cells. This can be 
realised as described in Example 2. The product of the 
modified luciferase gene is a luciferase fusion protein, 
where the luciferase is inactivated due to the fusion with 

35 the foreign part. The fusion protein, which contains a 
recognition sequence for a HCV protease, is cleaved in 
transfected cells with high HCV replication, and 
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consequently the active form of the luciferase, which can 
be identified through luminometric measurement, is 
released. If the replication of the HCV-RNA construct is 
inhibited the fusion protein will not be cleaved and no 
5 active luciferase will be released. The quantitative 
determination of the luciferase is therefore a measure for 
the replication of the HCV construct. Instead of the 
luciferase gene, another reporter gene can just as easily 
be used, which is modified in the same way, so that its 

10 expression depends on the viral replication, although this 
reporter gene is not part of the HCV construct. A cellular 
protein, which is deactivated or activated by the HCV 
proteins or nucleic acid, can also be used as a so called 
surrogate marker. The expression or activity of this 

15 surrogate marker is in this case a measure for the 
replication of the viral DNA. 

Example 6 ; Production of HCV subgenomic constructs with 
20 integrated foreign genes to be used as liver 

cell specific vector in gene therapy 

These recombinant and selectable HCV subgenomic constructs 
are transfected in transcomplementing helper cell lines, 

25 which means in cell lines, which express the missing 
functions (for example the structural proteins) in an 
inducible or constitutive way. Cell clones containing a 
functional HCV subgenomic construct can be established 
through appropriate selection. The viral structural 

30 proteins expressed from the host cell allow the formation 
of virus particles, into which the RNA of the HCV 
subgenomic constructs will be transfected. The results are 
therefore virus-like particles, which contain a HCV 
subgenomic construct according to the invention including 

35 the inserted foreign gene and which can transmit this to 
other cells by means of infection. An example for this 
construct is illustrated in Fig. 8. 
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It is also possible to use the HCV subgenomic construct 
with integrated foreign gene directly as an expression 
vector. This involves the same method as mentioned 
5 previously except that cell lines, which do not express 
transcomplementing factors, are transfected. In this case 
the HCV construct is only used as an expression vector. 

10 Example 7 : Production of cell culture adapted HCV-RNA 
constructs 

(A) Method of isolation 

15 The following method was used to determine adaptive 
mutations and to produce cell culture adapted HCV-RNA 
constructs: cells were transfected with a HCV-RNA construct 
as described in Examples 1 and 2 and G418-resistant cell 
clone produced. For the determination of ability to 

20 replicate (understood in this context to be the number of 
G418 resistant cell clones obtained per microgram of 
transfected HCV-RNA or HCV-RNA construct), the total RNA 
from one of the cell clones, [the 9-13 clone (Fig. 1 B, 
lane 11)], was isolated and the quantity of HCV-RNA 

25 contained within it was determined by Northern Blot as 
described in Fig. 2 B. Ten micrograms of the total RNA, 
containing approx. 10 9 molecules of HCV-RNA, was then 
transfected into naive Huh-7 cells using electroporation 
(Fig. 9) . In parallel, 10 9 in vitro transcripts of the 

30 analoguous n eo- HC V- RNA , which had been adjusted with 
isolated total RNA from naive Huh-7 cells to a total RNA 
quantity of 10 \xq r were transfected in naive Huh-7 cells. 
After selection with G418, the number of cell colonies, 
expressed as 'colony forming units (cfu) per microgram 

35 RNA 1 , was determined in both these cultures. At a 
concentration of 500 pg/ml G418 in the selection medium, 
the number of colonies obtained with the HCV-RNA contained 
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in Isolated total RNA from clone 9-13, was approx. 100,000 
cfu per microgram HCV-RNA. In contrast, only 30 - 50 
colonies were obtained with the same quantity of in vitro 
transcribed HCV-RNA. This result confirms that the specific 
5 infectivity of the HCV-RNA isolated from the . cell clones is 
approx. 1,000 - 10,000 times higher than the infectivity of 
the analoguous in vitro transcripts. The experimental 
approach is shown in Fig. 9. 

10 With the aid of 'long-distance RT-PCR' , the HCV-RNA was 
amplified from the total RNA of the 9-13 cells, the PCR 
amplificate was cloned and numerous clones were sequenced. 
A comparison of the sequences of these recloned RNAs with 
the sequence of the RNA originally transfected into the 

15 naive Huh-7 cells, showed that the recloned RNAs possessed 
numerous amino acid exchanges distributed over the whole 
HCV sequence (Fig. 10). Sfil fragments of these recloned 
mutants were used to replace the analoguous Sfil fragment 
of the original replicon construct, and RNAs of the 

20 respective mutants were transfected in naive Huh-7 cells. 
After selection with G418 the number of colonies created 
was determined for each HCV-RNA mutant. While only 30 - 50 
colonies per microgram RNA were obtained with the parental 
RNA the number of colonies was noticeably higher for two of 

25 the recloned variants (Fig. 10) . In the case of the HCV-RNA 
constructs 9-131 and 9-13C the specific infectivity was 
increased to 100 - 1, 000*' cfu per microgram RNA and for 9- 
13F replicon it was 1,000 - 10,000 cfu per microgram RNA. 
These results show that the amino acid exchanges in the 

30 analysed NS3-5B regions of the mutants 9-131, 9-13C and 
particularly of 9-13F, led to a considerable increase in 
ability to replicate. In contrast all the other HCV-RNA 
constructs (9-13 A, B, G, H und K) were no longer able to 
replicate, they thus contained lethal mutations. 

35 

In order to answer the question which of the amino acid 
exchanges in the 9-13F construct led to an increase in 
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replication, the exchanges were introduced separately or in 
combination into the parental HCV-RNA construct, and the 
corresponding RNAs transfected in naive Huh-7 cells. The 
result of the transfection with these RNAs is summarised in 

5 Table 1. From this it is evident that in the present 
example the high ability to replicate is determined by 
several mutations. The amino acid exchanges in the HCV-RNA 
regions NS5A and NS4B make the greatest contribution. The 
single exchanges in the NS3-Region also make a contribution 

10 and perhaps they are synergistic. 

These results confirm that it was through the G418 
selection of the cells transfected with the neo— HCV-RNA 
construct that there was enrichment of those HCV-RNAs 

15 having noticeably higher ability to replicate. HCV-RNA 
constructs with greatly differing replication efficiencies 
can be selected using the experimental approach described 
here. The higher the concentration of the antibiotic in the 
selection medium, in/on which . the HCV-RNA construct 

20 containing cells are cultivated for selection, the higher 
must be the extent of adaptive mutations and hence 
replication efficiency of the relevant HCV-RNA constructs, 
to allow the cells to grow under these conditions. If the 
selections are carried out using lower antibiotic 

25 concentrations, cells can survive and multiply, but the 
HCV-RNA construct shows a comparatively lower replication 
efficiency and fewer adaptive mutations. 

As has been shown, the 9-13F HCV-RNA construct described so 
30 far, which contains several adaptive mutations, had a 
higher replication efficiency than the parental HCV-RNA. In 
order to obtain HCV-RNAs with even higher replication in 
cell culture, the HCV-RNA contained in the total RNA of a 
selected cell clone was passaged several times in naive 
35 Huh-7 cells. The selected 5-15 cell clone, was obtained by 
transfection with the HCV-RNA construct I 3 8 9 /NS3-3 r (Fig. 
1). It largely corresponds to the cell clone 9-13, produced 
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by transfection with a HCV-RNA construct, having a HCV-IRES 
shorter by 22 nucleotides ( I 37 7/NS3-3 ' ; Fig. 1). Ten 
micrograms of total RNA, isolated from cell clone 5-15, 
were transfected into naive Huh-7 cells using 
5 electroporation and the cells subjected to a selection with 
1 mg/ml G418. The total RNA from one of the cell clones 
thus produced was again isolated, transfected into naive 
Huh-7 cells and selected in the same way. This process was 
repeated a total of four times. After the fourth passage 

10 the total RNA was isolated from a cell clone and the neo- 
HCV-RNA amplified with the aid of the f long-distance RT- 
PCR' . The amplified DNA fragment was digested with the 
restriction enzyme S^il and inserted into the Sfil- 
restricted parental construct I 389 /NS3-3'. Over 100 DNA 

15 clones were obtained altogether and then analysed by means 
of restriction digestion. In vitro transcribed RNA of about 
80 of these clones was each transfected into naive Huh-7 
cells and subjected to a selection with 500mg/ml G418. Of 
the 80 neo -HCV-RNA variants examined, the great majority 

20 proved to be replication defective. However, the specific 
infectivity, expressed as 'colony forming units' per 
microgram RNA, was noticeably increased in the case of two 
mutants, 5.1 and 19 (Table 2). Through several passages of 
the RNA in cell culture it is clear that HCV-RNAs are 

25 produced whose replication efficiency due to mutations 
(^adaptive mutations") is several orders of magnitude 
higher than the original RNA cloned from patients. 

(B) Modified method 

30 

Adaptive mutations produced and identified according to (A) 
can be transferred into a HCV-RNA construct with low 
ability to replicate. This leads to a huge increase in the 
replication of this construct. The increase is so great it 
35 can be demonstrated that HCV-RNAs transfected into cell 
culture can replicate even in the absence of selection 
pressure Fig. 12 shows a comparison of the replication 
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efficiency of HCV-RNAs, which corresponded either to the 
starting sequence or to the adaptive sequences 9-13F or 
5.1. For the purposes of simple measurement, the neo-gene 
was removed and replaced by the gene for luciferase. The 
5 negative control used was again a HCV-RNA construct that 
was replication defective due to a deactivating mutation in 
the NS5B RNA polymerase. Already 24 hours after 
transfection a noticeable difference is evident in 
luciferase activity between the defective RNA and the 9-13F 

10 or 5.1 constructs , while hardly any difference could be 
seen between the defective RNA (318 DN) and the parental 
RNA construct (wt) that possessed no adaptive mutations. 
During the whole period of observation, the highest 
luciferase activity, and thus highest replication, was 

15 obtained with the 5.1-RNA. These results not only confirm 
the high replication efficiency of this RNA, but also show 
that it is possible to create a cell culture system with 
adapted HCV-RNA constructs for which the presence of a 
selectable gene is no longer necessary. A summary of the 

20 nucleotide and amino acid differences between the starting 
construct and the mutants 9-13F, 5.1 and 19 is presented in 
Table 3. 

25 Example 8: Production of call culture adapted HCV-RNA 

full-length genome 

In the examples 1 to 7 a subgenomic HCV-RNA was used which 
lacked the whole structural protein region from core up to 

30 p7 or even NS2. It will be shown in this example that it is 
possible to make a HCV full-length genome replicate in cell 
culture with the aid of an adapted NS3-5B sequence. For 
this purpose the Sfil fragment of the highly adapted HCV- 
RNA 5 . 1 produced according to Example 7 is first 

35 transferred into a selectable HCV full-length genome (Fig. 
12) . This HCV genome was transfected into naive Huh-7 cells 
and subjected to selection with various G418 
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concentrations. Depending on the strength of selection (the 
G418 concentration) r a varying large number of cell clones 
was obtained (Fig. 12 B) . By contrast no colonies were 
obtained with the parental HCV full-length genome 
5 containing no adaptive mutations, as was the case for the 
negative control, which was replication defective due to a 
deactivating muration in the NS5B RNA polymerase. To 
confirm that the thus resulting cell clones really 
contained an autonomously replicating HCV full-length 

10 construct, total RNA from several cell clones was isolated 
and analysed by means of the Northern Blot method. The 
full-length HCV-RNA was clearly detectable in all cell 
clones (Fig. 12) . It is thus clearly confirmed, that with 
the aid of cell culture adapted HCV sequences it is 

15 possible to produce a HCV full-length genome, which 
replicates highly efficiently and autonomously in a cell 
line, i.e. adapted HCV full-length genomes can also be 
produced with the system of the invention. Furthermore, as 
this clone possesses the complete HCV sequence, i.e. it 

20 also possesses the structural proteins necessary for virus 
particle formation, it is possible to produce large 
quantities of infectious virus particles in cell cultures 
with this system. As a confirmation of these viruses, cell- 
free cell supernatants carrying a replicating HCV full- 

25 length genome, are added to naive Huh-7 cells and the thus 
infected cells subjected to selection with G418. Each cell 
clone growing under these conditions originates . from an 
infected cell. The viruses in the cell culture supernatant 
of cells possessing a replicating HCV full-length genome 

30 can be enriched and purified using various known state of 
the art methods such as ultracentrifugation or 
microdialysis. They can then be used for the infection of 
naive cells. Using this method it is clearly demonstrated 
that cell culture adapted full-length genomes can be 

35 produced with the HCV cell culture system of the invention. 
These genomes replicate with high efficiency in cells and 
produce infectious viruses. The latter can be detected by 
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infection of an experimental animal, preferably a 
chimpanzee. 

5 Example 9 : Production of HCV full-length constructs and 
HCV subgenomic constructs with reporter genes 

A HCV-RNA construct is produced in which a reporter gene is 
inserted in place of the antibiotic resistance gene (Fig. 

10 13) . Replication can thereby be determined through the 
quantity or activity of the reporter gene or reporter gene 
product. The reporter gene is preferably a gene from the 
group of the luciferase genes, the CAT gene 
(chloramphenicol acetyl transferase gene) , the lacZ gene 

15 (beta galactosidase gene) , the GFP gene (green fluorescence 
protein gene) , the GUS gene (glucuronidase gene) or the 
SEAP gene (secreted alkaline phosphatase gene) «. This 
reporter gene and its products, namely the relevant 
reporter proteins, can be detected for example using 

20 fluorescence, chemiluirtinescence, colorimetrically or by 
means of immunological methods (for example, enzyme-linked 
immunosorbent assay, ELISA) . 

The reporter gene can be expressed either from a separate 
IRES or in the form of a fusion protein, which is active 
25 either as such or fused with a HCV protein via a 
poteolytically cleavable amino acid sequence in such a way 
that the reporter is separated from the HCV protein by 
cleavage of a cellular or viral (HCV) protease, 

30 

Example 10 : Production of HCV full-length constructs with 
integrated foreign genes used as liver cell 
specific vectors for gene therapy or as 
expression vectors. 

35 

The construct (Fig. 14) is transfected in cells and leads 
to the formation of HCV virus particles that can be used 



CA 02303526 2000*03-31 

- 60 - 

for the infection of further cells. Since the virus 
particles have encapsidated RNA with a foreign gene, it can 
be used in the infected cells for the production of the 
protein coded by this foreign gene. Cells transfected with 
5 the construct also express the foreign gene. 

Example 11 : Production of monocistxonic HCV-FNA constructs 
in which the resistance gene product is 
10 expressed as a fusion protein with the HCV 

portion . 

It is an advantage for some tests if the HCV- RNA construct 
does not possess a heterologous IRES element. Tests of this 

15 type are, for example, the determination of interferon 
resistance. If a cell possessing a HCV -RNA construct is 
incubated with interferon alpha or beta, a reduction in 
replication of the HCV-RNA results. In order to explain the 
mechanism of this effect it is necessary for the HCV-RNA 

20 construct not to possess any heterologous IRES, as 
otherwise it is not possible to determine whether the 
interferon mediated inhibition is via inhibition of the HCV 
replication or inhibition of the heterologous IRES. For 
this reason constructs are produced for which the 

25 resistance gene is fused with a HCV protein (Fig. 15), 
Either the fusion protein is active as such or the 
resistance gene product "is linked to a HCV protein via a 
proteolytically cleavable . amino acid sequence in such a way 
that it is separated from the HCV protein by a cellular or 

30 viral (HCV) protease. 
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Table 1: Specific inf ect i vit ies (cfu/pg RNA) of HCV RNA 
constructs with adaptive mutations found with the 9-13F 
mutant introduced into the parental construct I 3 89/NS3-3 1 /wt 



amino acid exchange 


HCV protein 


cfu/pg RNA^ 


none 




30-60 


1283 arg -> gly 


NS3 


200 - 250 i 


1383 glu -> ala 


NS3 


30 - 60 


1577 lys -> arg 


NS3 


30 - 60 


1609 lys -> glu 


M O *3 


J.DU JUU 


{1283 arg -> gly + 1383 glu - 
> ala + 1577 lys -> arg + 
1609 lys -> glu) 


NS3 


360 - 420 


1936 pro -> ser 


NS4B 


1000-5000 


2163 glu -> gly 


NS5A 


1000-5000 


2330 lys -> glu 


NS5A 


30-60 1 


2442 ile -> val 


NS5A 


30 - 60 


all together 




5000 



1 amino acid change in the polyprotein of the HCV isolate 
con 1 (EMBL-gene bank No. AJ238799) ; amino acids are given 
in single letter code. 

2 Colony forming units (number of cell clones) obtained 
10 with a selection of .500 pg/ml G418. 
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Table 2: Specific infect ivities (cfu/pg RNA) of the 
parental HCV RNA construct I 3 89/NS3-3 ' /wt and the variants 

9-131, 9-13F, 5.1 and 19. 



Transfected RNA variant 


cfu/pg RNA 1 


wild type 


30 - 50 


9-13 I 


100 - 1.000 : 


9-13 F 


1.000 - 10.000 


5.1 


50.000 - 100.000 


19 


50.000 - 100.000 



1 Colony forming units (number of cell clones) obtained 
with a selection of 500yg/ml G418. 
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Table 3: Nucleotide and amino acid sequence differences 
between the parental HCV RNA construct I 3B 9/NS3-3 ' /wt and 
the mutants 9-131, 9-13F, 5.1 und 19 



Clone 


nt-position 


nt-exchange 


aa-exchange 


9-13 I 


3685 


C > T 


P > L 




4933 


C > T 


T > M 




5249 


T > C 







8486 


C > T 






8821 


G > A 


W > stop 




8991 


C > G 


R > G 




9203 


A > G 







9313 


T > C 


F > S 




9346 


T > C 


V > A j 


9-13 F 


3866 


C > T 







4188 


A > G 


R > G 




4489 


A > C 


E > A 




4562 


G > A 






4983 


T > C 


— i 




5071 


A > G 


K > R 




5166 


A > G 


K > E 




6147 


C > T 


P > S 




6829 


A > G 


E > G 




7329 


A > G 


K > E 




7 664 


A > G 


I > V 




8486 


C > T 






8991 


C > G 


R > G 


NK5. 1 


4180 


C > T | 


T > I 




4679 


C > T 






4682 


T > C 






5610 


C > A 


L > I 




6437 


A > G 






6666 


A > G 


N > D 
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6842 


C > T 






6926 


C > T 






6930 


T > C 


S > P 




7320 


C > T 


P > s 




7389 


A > G 


K > E 


NK19 


3946 


A > G 


E > G 

• 




4078 


C > G 


A > G 




4180 


C > T 


T > I 




4682 


T > C 






5610 


C > A 


L > I 




5958 


A > T 


M > L 




6170 


T > A 






6596 


G > A 






6598 


C > G 


A > G 




6833 


C > T 






6842 j 


C > T 






6930 


T > C 


S > P 




7141 


A > G 


E > G 




7320 


C > T 


P > S 




7389 


A > G 


K > E 




7735 


G > A 


S > N 



Given are the differences between the nucleotide and amino 
acid sequences of the parental HCV RNA sequence con 1 
(EMBL-gene bank No. AJ238799) and those of the cell culture 
adapted HCV RNAs . Numbers refer to the nucleotide and amino 
acid positions of the con 1 isolate, nt, nucleotide; aa, 
amino acid. 
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SEQUENCE LISTING 

<110> Bartenschlager, Ralf 

<120> Hepatitis C Virus cell culture systen 

<130> ba-1 

<140> 199 15 178.4 
<141> 1999-04-03 

<160> 11 

<17C> Parentln Ver. 2.1 

<210> 1 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 
<400> 1 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg t^agtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct rgggtggaga ggctattcgg ctatgactgg gcacaacaga 4 80 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc .ggaacccccc acctggcgacf'aggtgcctct gcggccaaaa ' gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 2460 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 2640 
gtgggagatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
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cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgcag 2820 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat g^gacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc crgaactgca atgactccct caacactggg 3120 
ttccttgctg cgcrtgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggrccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
crgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 354 0 
aaaaccttga cctgccccac cgactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
tgtggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 
cactacccct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttgtaa cctggaggac 3780 
agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 
tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 
gtggacgtac aatacctgta cg.gtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
atgatgctgc tgatagctca agctgaggcc gccctagaga acctggtggt cctcaacgcg 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcg- atggccgcta 4200 
ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 42 60 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
ctgttcctcg . ctagcctcat atggtggtta caatatttta tcaccagggc cgaggcacac 4380 
ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt catcctcctc 4440 
acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 
ggtccactca tggtgctcca ggcrggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4 620 
gctctcatga agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4680 
cgggactggg cccacgcggg cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 
tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4 860 
agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggcctactc ccaacagacg 4 920 
cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4 980 
ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 504 0 
gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcacgct 5340 
gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt ctatggaaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 54 60 
ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
agcactaagg tgccggctgc gt atgcagccr'caagggtata aggtgcttgt * cctgaacccg 5580 
tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 
aacatcagaa ccggggtaag gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 
gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 594 0 
ggcaaagcca tccccatcga gaccatcaag ggggggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctgtccggcc tcggactcaa tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
acggacgctc taatgacggg ctttaccggc gatttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgct tggtacgagc tcacgcccgc cgagacctca 6420 
gttaggttgc gggcttacct aaacacacca gggttgcccg tctgccagga ccatctggag 64 80 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 
aagcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg . 6600 
gctcaggctc cacctccatc gtgggaccaa atgtggaagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
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acgagcacct 
acaggcagcg 
gacagggaag 
t acatcgaac 
caaacagcca 
ctcgaagcct 
ggcttgtcca 
atcaccagcc 
gccgcccaac 
gcggctgttg 
gcaggggtgg 
gaggacctgg 
gtgtgcgcag 
aaccggctga 
gagagcgacg 
ctgaagaggc 
ctaagagarg 
tccaagctcc 
ggagtctggc 
ggacatgtga 
catggaacat 
aattattcta 
ggggatttcc 
ccggcccccg 
tgcaaacccc 
gggtcacagc 
gacccctccc 
tccttggcca 
acccgtcatg 
. atgggcggga 
gagccgctcc 
aggtccagga 
ctgttagagt 
ccgcctgcca 
gaatctaccg 
tcgtcggccg 
gacgcgggat 
gatcccgatc 
gtctgctgct 
gaaaccaagc 
tatgctacaa 
caggtcctgg 
gttaaggcta 
agatctaaat 
cacatccgct 
atcatggcaa 
cgccttatcg 
gtggtctcca 
ggacagcggg 
gcatatgaca 
tcaatctacc 
gagcggcttt 
cggtgccgcg 
aaggccgctg 
gacgaccttg 
gccttcacgg 
tacgacttgg 
ggcaaaaggg 
gagacagcta 
accttgtggg 
caacttgaaa 
gacctacctc 
tctccaggtg 
cgagtctgga 
gctgccactt 
ccaatcccgg 
ggagacatat 



gggtgctggt 

tggtcattgt 

tcctttaccg 

agggaatgca 

ccaagcaagc 

tctgggcgaa 

ctctgcctgg 

cgctcaccac 

ttgctcctcc 

gcagcatagg 

caggcgcgct 

ttaacctact 

cgatactgcg 

tagcgttcgc 

ctgcagcacg 

ttcaccagtg 

tttgggattg 

tgccgcgatt 

ggggcgacgg 

aaaacggttc 

tccccattaa 

gggcgctgtg 

actacgtgac 

aattcttcac 

tcctacggga 

tcccatgcga 

acattacggc 

gctcatcagc 

actccccgga 

acatcacccg 

aagcggagga 

aattccctcg 

cctggaagga 

aggcccctcc 

tgtcttctgc 

tcgacagcgg 

ccgacgttga 

tcagcgacgg 

cgatgtccta 

tgcccatcaa 

catctcgcag 

acgaccacta 

aacttctatc 

ttggctatgg 

ccgtgtggaa 

aaaatgaggt 

tattcccaga 

ccctccctca 

tcgagttcct 

cccgctgttt 

aatgttgtga 

acatcggggg 

cgagcggtgt 

cggcctgtcg 

tcgttatctg 

aggctatgac 

agttgataac 

tgtactatct 

gacacactcc 

caaggatgat 

aagccctaga 

agatcattca 

agatcaatag 

gacatcgggc 

gtggcaagta 

ctgcgtccca 

atcacagcct 



aggcggagtc 

gggcaggatc 

ggagttcgat 

gctcgccgaa 

ggaggctgct 

gcatatgtgg 

caaccccgcg 

ccaacatacc 

cagcgctgct 

ccttgggaag 

cgtggccttt 

ccctgctatc 

tcggcacctg 

ttcgcggggt 

tgtcactcag 

gatcaacgag 

gatatgcacg 

gccgggagtc 

catcatgcaa 

catgaggatc 

cgcgtacacc 

gcgggtggct 

gggcatgacc 

agaagtggat 

ggaggtcaca 

gcccgaaccg 

ggagacggct 

tagccagctg 

cgctgacctc 

cgtggagtca 

ggatgagagg 

agcgatgccc • 

cccggactac 

gataccacct 

cttggcggag 

cacggcaacg 

gtcgtactcc 

gtcttggtct 

cacatggaca 

tgcactgagc 

cgcaagcctg 

ccgggacgtg 

cgtggaggaa 

ggcaaaggac 

ggacttgctg 

tttctgcgtc 

tttgggggtf 

ggccgtgatg 

ggtgaatgcc 

tgactcaacg 

cttggccccc 

ccccctgact 

actgacgacc 

agctgcgaag 

tgaaagcgcg 

tagatactct 

atcatgctcc 

cacccgtgac 

agtcaattcc 

cctgatgact 

ttgtcagatc 

acgactccat 

ggtggcttca 

cagaagtgtc 

cctcttcaac 

gttggattta 

gtctcgtgcc 



ctagcagctc 

atcttgtccg 

gagatggaag 

caattcaaac 

gctcccgtgg 

aatttcatca 

atagcatcac 

ctcctgttta 

tctgctttcg 

gtgcttgtgg 

aaggtcatga 

ctctcccctg 

ggcccagggg 

aaccacgtct 

atcctctcta 

gactgctcca 

gtgttgactg 

cccttcttct 

accacctgcc 

gtggggccta 

acgcgcccct 

gctgaggagt 

actgacaacg 

ggggtgcggt 

ttcctggtcg 

gacgtagcag 

aagcgtaggc 

tctgcgcctt 

atcgaggcca 

gaaaataagg 

gaagtatccg 

atatgggcac 

gtccctccag 

ccacggagga 

ctcgccacaa 

gcctctcctg 

tccatgcccc 

accgtaagcg 

ggcgccctga 

aactctttgc 

cggcagaaga 

ctcaaggaga 

gcctgtaagc 

gtccggaacc 

gaagacactg 

caaccagaga 

cgtgtgtgcg 

ggctcttcat 

tggaaagcga 

gtcactgaga 

gaagccagac 

aattctaaag 

agctgcggta 

ctccaggact 

gggacccaag 

gccccccctg 

tccaatgtgt 

cccaccaccc 

tggctaggca 

catttcttct 

tacggggcct 

ggccttagcg 

tgcctcagga 

cgcgctaggc 

tgggcagtaa 

tccagctggt 

cgaccccgct 



tggccgcgta 
gaaagccggc 
agtgcgcctc 
agaaggcaat 
tggaatccaa 
gcgggataca 
tgatggcatt 
acatcctggg 
taggcgccgg 
atattttggc 
gcggcgagat 
gcgccctagt 

agggggctgt 

cccccacgca 

gtcttaccat 

cgccatgctc 

atttcaagac 

catgtcaacg 

catgtggagc 

ggacctgtag 

gcacgccctc 

acgtggaggt 

taaagtgccc 

tgcacaggta 

ggctcaatca 

tgctcacttc 

tggccagggg 

ccttgaaggc 

acctcctgtg 

tagtaatttt 

ttccggcgga 

gcccggatta 

tggtacacgg 

agaggacggt 

agaccttcgg 

accagccctc 

cccttgaggg 

aggaggctag 

tcacgccatg 

tccgtcacca 

aggtcacctt 

tgaaggcgaa 

tgacgccccc 

tatccagcaa 

agacaccaat 

aggggggccg 

agaaaatggc 

acggattcca 

agaaatgccc 

atgacatccg 

aggccataag 

ggcagaactg 

ataccctcac 

gcacgatgct 

aggacgaggc 

gggacccgcc 

cagtcgcgca 

cccttgcgcg 

acatcatcat 

ccatccttct 

gttactccat 

cattttcact 

aacttggggt. 

tactgtccca 

ggaccaagct 

tcgttgctgg 

ggttcatgtg 



ttgcctgaca 

catcattcoc 

acacctccct 

cgggttgctg 

ctggcggacc 

atatttagca 

cacagcctct 

gggatgggtg 

catcgctgga 

aggttatgga 

gccctccacc 

cgtcggggtc 

gcagtggatg 

ctatgtgcct 

cactcagctg 

cggctcgzgg 

ctggctccag 

tgggtacaag 

acagatcacc 

taacacgtgg 

cccggcgcca 

tacgcgggtg 

gtgtcaggtt 

cgctccagcg 

atacctggtt 

catgctcacc 

atctcccccc 

aacatgcact 

gcggcaggag 

ggactctttc 

gatcctgcgg 

caaccctcca 

gtgtccattg 

tgtcctgtca 

cagctccgaa 

cgacgacggc 

ggagccgggg 

tgaggacgtc 

cgctgcggag 

caacttggtc 

tgacagactg 

ggcgtccaca 

acattcggcc 

ggccgttaac 

tgacaccacc 

caagccagct 

cctttacgat 

atactctcct 

tatgggcttc 

tgttgaggag 

gtcgctcaca 

cggctatcgc 

atgttacttg 

cgtatgcgga 

gagcctacgg 

caaaccagaa 

cgatgcatct 

ggctgcgtgg 

gtatgcgccc 

agctcaggaa 

tgagccactt 

ccatagttac 

accgcccttg 

gggggggagg 
caaactcact 
ttacagcggg 
gtgcctactc 



6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

768C 

7740 

7300 . 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 
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ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 

ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 

tttttttttt tttttttttc tccttttttt . ttcctctttt tttccttttc tttcctttgg 10980 

tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 

agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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<210> 2 
<211> 8637 
<212> DNA 

<213> Hepatitis C Virus 
<400> 2 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccczccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaggg cgcgccatga 
cggccgcttg ggtggagagg crattcggct 
ctgatgccgc cgtgttccgg c-gtcagcgc 
acctgtccgg tgccctgaat gaactgcagg 
cgacgggcgt tccttgcgca gctgtgctcg 
tgctattggg cgaagtgccg gggcaggatc 
aagtatccat catggctgat gcaatgcggc 
cattcgacca ccaagcgaaa catcgcatcg 
ttgtcgatca ggatgatctg gacgaagagc 
ccaggctcaa ggcgcgcatg cccgacggcg 
gcttgccgaa tatcatggtg gaaaatggcc 
tgggtgtggc ggaccgctat caggacatag 
ttggcggcga atgggctgac cgcttcctcg 
agcgcatcgc cttctatcgc cttcttgacg 
gtttccctct agcgggatca attccgcccc 
cgaagccgct tggaataagg ccggtgtgcg 
ccgtcttttg gcaatgtgag ggcccggaaa 
aggggtcttt cccctctcgc caaaggaatg 
gttcctctgg aagcttcttg aagacaaaca 
aaccccccac ctggcgacag gtgcctctgc 
gcaaaggcgg cacaacccca gtgccacgtt 
tggctctcct caagcgtatt caacaagggg 
atgggatctg atctggggcc tcggtgcaca 
aacgtctagg ccccccgaac cacggggacg 
atggaccggg agatggcagc atcgtgcgga 
accttgtcac cgcactataa gctgttcctc 
atcaccaggg ccgaggcaca cttgcaagtg 
cgcgatgccg tcatcctcct cacgtgcgcg 
aaaatcttgc tcgccatact cggtccactc 
ccgtacttcg tgcgcgcaca cgggctcatt 
gggggtcatt atgtccaaat ggctctcatg 
tatgaccatc tcaccccact gcgggactgg 
gcagttgagc ccgtcgtctt ctctgatatg 
accgcggcgt gtggggacat catcttgggc 
atacatctgg gaccggcaga cagccttgaa 
acggcctact cccaacagac gcgaggccta 
cgggacagga accaggtcga gggggaggtc 
ctggcgacct gcgtcaatgg cgtgtgttgg 
cttgccggcc caaagggccc aatcacccaa 
ggctggcaag cgccccccgg ggcgcgttcc 
ctttacttgg tcacgaggca tgccgatgtc 
gggagcctac tctcccccag gcccgtctcc 
ctctgcccct cggggcacgc tgtgggcatc 
gcgaaggcgg tggactttgt acccgtcgag 
ttcacggaca actcgtcccc tccggccgta 
gcccctactg gtagcggcaa gagcactaag 
aaggtgcttg tcctgaaccc gtccgtcgcc 
aaggcacatg gtatcgaccc taacatcaga 
cccatcacgt actccaccta tggcaagttt 
tatgacatca taatatgtga tgagtgccac 
ggcacagtcc tggaccaagc ggagacggct 
acgcctccgg gatcggtcac cgtgccacat 
actggagaaa tcccctttta tggcaaagcc 
cacctcattt tctgccattc caagaagaaa 



catagatcac tcccctgtga ggaactactg 60 
ttagzatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
ttgaacaaga tggattgcac gcaggttctc 420 
atgactgggc acaacagaca atcggctcct 480 
aggggcgccc ggttcttttr gtcaagaccg 540 
acgaggcagc gcggctatcg tggctggcca 600 
acgttgtcac rgaagcggga agggactggc 660 
tcctgtcatc tcaccttgct cctgccgaga 720 
ggctgcatac gcttgatccg gctacctgcc 780 
agcgagcacg tactcggatg gaagccggtc 840 
atcaggggct cgcgccagcc gaactgttcg 900 
aggatctcgt cgtgacccat ggcgatgcct 960 
gcttttctgg attcatcgac tgtggccggc 1020 
cgttggctac ccgtgatatt gctgaagagc 1080 
tgctttacgg tatcgccgct cccgattcgc 1140 
agttcttctg agtttaaaca gaccacaacg 1200 
tctccctccc ccccccctaa cgttactggc 1260 
tttgtctata tgttatttitc caccatattg 1320 
cctggccctg tcttcttgac gagcattcct 1380 
caaggtctgt tgaatgtcgt gaaggaagca 1440 
acgtctgtag cgaccctttg caggcagcgg 1500 
ggccaaaagc cacgtgtata agatacacct 1560 
gtgagttgga tagttgtgga aagagtcaaa 1620 
ctgaaggatg cccagaaggt accccattgt 1680 
tgctttacat gtgtttagtc gaggttaaaa 1740 
tggttttcct ttgaaaaaca cgataatacc 1800 
ggcgcggttt tcctaggtct gatactcttg 1860 
gctaggctca tatggtggtt acaatatttt 1920 
tggatccccc ccctcaacgt tcgggggggc 1980 
atccacccag agctaatctt taccatcacc 2040 
atggtgctcc aggctggtat aaccaaagtg 2100 
cgtgcatgca tgctggtgcg gaaggttgct 2160 
aagttggccg cactgacagg tacgtacgtt 2220 
gcccacgcgg gcctacgaga ccttgcggtg 2280 
gagaccaagg ttatcacctg gggggcagac 2340 
ctgcccgtct ccgcccgcag ggggagggag 24 00 
gggcaggggt ggcgactcct cgcgcctatt 24 60 
cttggctgca tcatcactag cctcacaggc 2520 
caagtggtct ccaccgcaac acaatctttc 2580 
actgtctatc atggtgccgg ctcaaagacc 2640 
atgtacacca atgtggacca ggacctcgtc 2700 
ttgacaccat gcacctgcgg cagctcggac 2760 
attccggtgc gccggcgggg cgacagcagg 2820 
tacttgaagg gctcttcggg cggtccactg 2880 
tttcgggctg ccgtgtgcac ccgaggggtt 2940 
tctatggaaa ccactatgcg gtccccggtc 3000 
ccgcagacat tccaggtggc ccatctacac 3060 
gtgccggctg cgtatgcagc ccaagggtat 3120 
gccaccctag gtttcggggc gtatatgtct 3180 
accggggtaa ggaccatcac cacgggtgcc 3240 
cttgccgacg gtggttgctc tgggggcgcc 3300 
tcaactgact cgaccactat cctgggcatc 3360 
ggagcgcgac tcgtcgtgct cgccaccgct 3420 
ccaaacatcg aggaggtggc tctgtccagc 3480 
atccccatcg agaccatcaa gggggggagg 3540 
tgtgatgagc tcgccgcgaa gctgtccggc 3600 
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ctcggactca 
ggagacgt ca 
tcagtgatcg 
ttcaccattg 
aggactggta 
ggcatgttcg 
ctcacgcccg 
gtctgccagg 
gcccatttct 
caggctacgg 
tgtctcatac 
gccgttcaaa 
tcggctgacc 
ctggccgcgt 
ggaaagccgg 
gagtgcgcct 
cagaaggcaa 
gtggaatcca 
agcgggatac 
ctgatggcat 
aacatcctgg 
gtaggcgccg 
gatattttgg 
agcggcgaga 
ggcgccctag 
gagggggctg 
tcccccacgc 
agtcttacca 
acgccatgct 
gatttcaaga 
tcatgtcaac 
ccatgtggag 
aggacctgta 
tgcacgccct 
racgtggagg 
gtaaagtgcc 
rtgcacaggt 
gggctcaatc 
gtgctcactt 
ctggccaggg 
tccttgaagg 
aacctcctgt 
gtagtaattt 
gttccggcgg 
cgcccggatt 
gtggtacacg 
aagaggacgg 
aagaccttcg 
gaccagccct 
ccccttgagg 
gaggaggcta 
atcacgccat 
ctccgtcacc 
aaggtcacct 
atgaaggcga 
ctgacgcccc 
ctatccagca 
gagacaccaa 
aaggggggcc 
gagaaaatgg 
tacggattcc 
aagaaatgcc 
aatgacatcc 
caggccataa 
gggcagaact 
aataccctca 
tgcacgatgc 



atgctgtagc 
ttgtcgtagc 
actgcaatac 
agacgacgac 
ggggcaggat 
attcctcggt 
ccgagacctc 
accatctgga 
tgtcccagac 
tgtgcgccag 
ggctaaagcc 
acgaggttac 
tggaggtcgt 
attgcctgac 
ccatcattcc 
cacacctccc 
tcgggttgct 
agtggcggac 
aatatttagc 
tcacagcctc 
ggggatgggt 
gcatcgctgg 
caggttatgg 
tgccctccac 
tcgrcggggt 
tgcagtggat 
actatgtgcc 
tcactcagct 
ccggctcgtg 
cctggctcca 
gtgggtacaa 
cacagatcac 
gtaacacgtg 
ccccggcgcc 
ttacgcgggt 
cgtgtcaggt. 
acgctccagc 
aatacctggt 
ccatgctcac 
gatctccccc 
caacatgcac 
ggcggcagga 
tggactcttt 
agatcctgcg 
acaacccicc 
ggtgtccatt 
ttgtcctgtc 
gcagctccga 
ccgacgacgg 
gggagccggg 
gtgaggacgt 
gcgctgcgga 
acaacttggt 
ttgacagact 
aggcgtccac 
cacattcggc 
aggccgttaa 
ttgacaccac 
gcaagccagc 
ccctttacga 
aatactctcc 
ctatgggctt 
gtgttgagga 
ggtcgctcac 
gcggctatcg 
catgttactt 
tcgtatgcgg 



atattaccgg 
aacggacgct 
atgtgtcacc 
cgtgccacaa 
gggcatttac 
tctgtgcgag 
agt taggttg 
gttctgggag 
taagcaggca 
ggctcaggct 
tacgctgcac 
taccacacac 
cacgagcacc 
aacaggcagc 
cgacagggaa 
ttacatcgaa 
gcaaacagcc 
cctcgaagcc 
aggcttgtcc 
tatcaccagc 
ggccgcccaa 
agcggctgtt 
agcaggggtg 
cgaggacctg 
cgtgtgcgca 
gaaccggctg 
tgagagcgac 
gctgaagagg 
gctaagagat 
gtccaagctc 
gggagtctgg 
cggacatgtg 
gcatggaaca 
aaattattct 
gggggatttc 
tccggccccc 
gtgcaaaccc 
tgggtcacag 
cgacccctcc 
ctccttggcc 
tacccgtcat 
gatgggcggg 
cgagccgctc 
gaggtccagg 
actgttagag 
gccgcctgcc 
agaatctacc 
atcgtcggcc 
cgacgcggga 
ggatcccgat 
cgtctgctgc 
ggaaaccaag 
ctatgctaca 
gcaggtcctg 
agttaaggct 
cagatctaaa 
ccacatccgc 
catcatggca 
tcgccttatc 
tgtggtctcc 
tggacagcgg 
cgcatatgac 
gtcaatctac 
agagcggctt 
ccggtgccgc 
gaaggccgct 
agacgacctt 



ggccttgatg 
ctaatgacgg 
cagacagtcg 
gacgcggtgt 
aggtttgtga 
tgctatgacg 
cgggcttacc 
agcgtcttta 
ggagacaact 
ccacctccat 
gggccaacgc 
cccataacca 
tgggtgctgg 
gtggtcattg 
gtcctttacc 
cagggaatgc 
accaagcacg 
ttctgggcga 
actctgcctg 
ccgctcacca 
cttgctcctc 
ggcagcatag 
gcaggcgcgc 
gttaacctac 
gcgatactgc 
atagcgttcg 
gctgcagcac 
cttcaccagt 
gtttgggatt 
ctgccgcgat 
cggggcgacg 
aaaaacggtt 
ttccccatta 
agggcgctgt 
cactacgtga 
gaattcttca 
ctcctacggg 
ctcccatgcg 
cacattacgg 
agctcatcag 
gactccccgg 
aacatcaccc 
caagcggagg 
aaattccctc 
tcctggaagg 
aaggcccctc 
'gtgtcttctg 
gtcgacagcg 
tccgacgttg 
ctcagcgacg 
tcgatgtcct 
ctgcccatca 
acatctcgca 
gacgaccact 
aaacttctat 
tttggctatg 
tccgtgtgga 
aaaaatgagg 
gtattcccag 
accctccctc 
gtcgagttcc 
acccgctgtt 
caatgttgtg 
tacatcgggg 
gcgagcggtg 
gcggcctgtc 
gtcgttatct 



tatccgtcat 
gctttaccgg 
acttcagcct 
cacgctcgca 
ctccaggaga 
cgggctgtgc 
taaacacacc 
caggcctcac 
tcccctacct 
cgtgggacca 
ccctgctgta 
aatacatcat 
taggcggagt 
tgggcaggat 
gggagttcga 
agctcgccga 
cggaggctgc 
agcatatgtg 
gcaaccccgc 
cccaacatac 
ccagcgctgc 
gccttgggaa 
tcgtggcctt 
tccctgctat 
gtcggcacgt 
ctrcgcgggg 
gtgtcactca 
ggatcaacga 
ggatatgcac 
tgccgggagt 
gcatcatgca 
ccatgaggat 
acgcgtacac 
ggcgggtggc 
cgggcatgac 
cagaagtgga 
aggaggtcac 
agcccgaacc 
cggagacggc 
ctagccagct 
acgctgacct 
gcgtggagtc 
aggatgagag 
gagcgatgcc 
acccggacta 
cgataccacc 
ccttggcgga 
gcacggcaac 
agtcgtactc 
ggtcttggtc 
acacatggac 
atgcactgag 
gcgcaagcct 
accgggacgt 
ccgtggagga 
gggcaaagga 
aggacttgct 
ttttctgcgt 
atttgggggt 
aggccgtgat 
tggtgaatgc 
ttgactcaac 
acttggcccc 
gccccctgac 
tactgacgac 
gagctgcgaa 
gtgaaagcgc 



accaactagc 
cgatttcgac 
ggacccgacc 
gcggcgaggc 
acggccctcg 
ttggtacgag 
agggttgccc 
ccacatagac 
ggtagcatac 
aatgtggaag 
taggctggga 
ggcatgcatg 
cctagcagct 
catcttgtcc 
tgagatggaa 
acaat tcaaa 
tgctcccgtg 
gaatttcatc 
gatagcatca 
cctcctgttt 
ttctgctttc 
ggtgcttgtg 
taaggtcatg 
cctctcccct 
gggcccaggg 
taaccacgtc 
gatcctctct 
ggactgctcc 
ggtgttgact 
ccccttcttc 
aaccacctgc 
cgtggggcct 
cacgggcccc 
tgctgaggag 
cactgacaac 
tggggtgcgg 
attcctggtc 
ggacgtagca 
taagcgtagg 
gtctgcgcct 
catcgaggcc 
agaaaataag 
ggaagtatcc 
catatgggca 
cgtccctcca 
tccacggagg 
"gctcgccaca 
ggcctctcct 
ctccatgccc 
taccgtaagc 
aggcgccctg 
caactctttg 
gcggcagaag 
gctcaaggag 
agcctgtaag 
cgtccggaac 
ggaagacact 
ccaaccagag 
tcgtgtgtgc 
gggctcttca 
ctggaaagcg 
ggtcactgag 
cgaagccaga 
taattctaaa 
cagctgcggt 
gctccaggac 
ggggacccaa 



3660 
3720 
37B0 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 



CA 02303526 2000-03-31 



gaggacgagg cgagcctacg ggccttcacg gaggctatga ctagatactc tgccccccct 7 680 

ggggacccgc ccaaaccaga atacgacttg gagttgataa catcatgctc ctccaatgtg 7740 

rcagtcgcgc acgatgcatc tggcaaaagg gtgtactatc tcacccgtga ccccaccacc 7800 

ccccttgcgc gggctgcgtg ggagacagct agacacactc cagtcaattc ctggctaggc 7860 

aacatcatca tgtatgcgcc caccttgtgg gcaaggatga tcctgatgac tcatttcttc 7 920 

tccatccttc tagctcagga acaacttgaa aaagccctag attgtcagat ctacggggcc 7980 

tgttactcca ttgagccact tgacctacct cagatcattc aacgactcca tggccttagc 8040 

gcattttcac tccatagtta ctctccaggt gagatcaata gggtggcttc atgcctcagg 8100 

aaacttgggg taccgccctt gcgagtctgg agacatcggg ccagaagtgt ccgcgctagg 8160 

ctactgtccc agggggggag ggctgccact tgtggcaagt acctcttcaa ctgggcagta 8220 

aggaccaagc tcaaactcac tccaatcccg. gctgcgtccc agttggattt atccagctgg 8280 

ttcgttgctg gttacagcgg gggagacata tatcacagcc tgtctcgtgc ccgaccccgc 8340 

tggttcatg- ggtgcctact cctacittct gtaggggtag gcatctarct actccccaac 8400 

cgatgaacgg ggagctaaac actccaggcc aataggccat cctgtttutt tccctttttt 8460 

tttttcttt- tttttttttt tttttttttt tttttttttt ctcctttttt tttcctcttt 8520 

ttttccttt- ctttcctttg gtggctccat cttagcccta gtcacggcta gctgtgaaag 8580 

gtccgtgagc cgcttgactg cagagagtgc tgatactggc ctctctgcag atcaagt 8637 
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<210> 3 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 
<400> 3 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagrggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctartg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcarcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 12 60 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc . cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcccaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac. 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag" gcggtggact ttgtacccgt cgagtctatg 2340 
gaaaccacta tgcggtcccc ggtcttcacg gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 24 60 
gctgcgtatg cagcccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaaccggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaagggggg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgattt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc araccaggct acggtgtgcg ccagggctca ggctccacct 3540 
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ccatcgtggg accaaatgtg caagtgtctc atacggctaa agcctacgct gcacgggcca 3600 

acgcccctgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 

accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 

ctggtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgrggtc 3780 

attgtgggca ggatcatctt gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 

taccgggagt tcgatgagat ggaagagtgc gcctcacacc tcccttacat cgaacaggga 3900 

atgcagctcg ccgaacaatt caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 

caagcggagg ctgctgctcc cgtggtggaa tccaagtggc ggaccctcga agcctrctgg 4020 

gcgaagcata tgtggaattt catcagcggg atacaatatt tagcaggctt gtccactctg 4080 

cctggcaacc ccgcgatagc atcactgatg gcattcacag cctctatcac cagcccgctc 4140 

accacccaac ataccctcct gtttaacatc ctggggggat gggtggccgc ccaacrtgct 4200 

cctcccagcg ctgctrctgc tttcgtaggc gccggcatcg ctggagcggc tgttggcagc 4260 

ataggccttg ggaaggtgct tgtggatatt ttggcaggtt atggagcagg ggtggcaggc 4320 

gcgctcgtgg cctttaaggt catgagcggc gagatgccct ccaccgagga cctgg-taac 4380 

ctactccctg ctatcctctc ccctggcgcc ctagtcgtcg gggtcgtgtg cgcagcgata 4440 

ctgcgtcggc acgtgggccc agcggagggg gctgtgcagt ggatgaaccg cctgatagcg 4500 

ttcgcttcgc ggggtaacca cgtctccccc acgcactatg tgcctgagag cgacgctgca 4560 

gcacgtgtca ctcagatcct ctctagtctt accatcactc agctgctgaa gaggcttcac 4620 

cagtggatca acgaggacrg ctccacgcca tcctccggct cgtggctaag agatgtttgg 4 680 

gattggatat gcacggtgtt gactgatttc aagacctggc tccagtccaa gctcctgccg 4740 

cgattgccgg gagtcccctt cttctcatgt caacgtgggt acaagggagt ctggcggggc 4 800 

gacggcatca tgcaaaccac ctgcccatgt ggagcacaga tcaccggaca tgtgaaaaac 4 860 

ggttccatga ggatcgtggg gcctaggacc tgtagtaaca cgtggcatgg aacattcccc 4 920 

attaacgcgt acaccacggg cccctgcacg ccctccccgg cgccaaatta ttctagggcg 4980 

ctgtggcggg tggctgctga ggagtacgtg gaggttacgc gggtggggga tttccactac 5040 

gtgacgggca tgaccactga caacgtaaag tgcccgtgtc aggttccggc ccccgaattc 5100 

ttcacagaag tggatgggg- gcggttgcac aggtacgctc cagcgtgcaa acccctccta 5160 

cgggaggagg tcacattcct ggtcgggctc aatcaatacc tggttgggtc acagctccca 5220 

tgcgagcccg aaccggacgr agcagtgctc acttccatgc tcaccgaccc ctcccacatt 5280 

acggcggaga cggctaagcg taggctggcc. aggggatctc ccccctcctt ggccagctca 5340 

tcagctagcc agctgtctgc gccttcctzg aaggcaacat gcactacccg tcatgactcc 5400 

ccggacgctg acctcatcga ggccaaccrc ctgtggcggc aggagatggg cgggaacatc 5460 

acccgcgtgg agtcagaaaa taaggtagta attttggact ctttcgagcc gctccaagcg 5520 

gaggaggatg agagggaagt atccgttccg gcggagatcc tgcggaggtc caggaaattc 5580 

cctcgagcga tgcccatatg ggcacgcccg gattacaacc ctccactgtt agagtcctgg 5640 

aaggacccgg actacgtccc tccagtggta cacgggtgtc cattgccgcc tgccaaggcc 5700 

cctccgatac cacctccacg gaggaagagg acggttgtcc tgtcagaatc taccgtgtct 5760 

tctgccttgg cggagctcgc cacaaagacc ttcggcagct ccgaatcgtc ggccgtcgac 5820 

agcggcacgg caacggcctc tcctgaccag ccctccgacg acggcgacgc gggatccgac 5880 

gttgagtcgt actcctccat gccccccctt gagggggagc cgggggatcc cgatctcagc 5940 

gacgggtctt ggtctaccgt aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 

tcctacacat ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 6060 

atcaatgcac tgagcaactc trtgctccgt caccacaact tggtctatgc tacaacatct. 6120 

cgcagcgcaa gcctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 6180 

cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaactt 6240 

ctatccgtgg aggaagcctg taagctgacg cccccacatt cggccagatc taaatttggc 6300 

tatggggcaa aggacgtccg gaacctatcc'agcaaggccg ttaaccacat ccgctccgtg 6360 

tggaaggact tgctggaaga cactgagaca ccaattgaca ccaccatcat ggcaaaaaat 6420 

gaggttttct gcgtccaacc agagaagggg ggccgcaagc cagctcgcct tatcgtattc 6480 

ccagatttgg gggttcgtgt gtgcgagaaa atggcccttt acgatgtggt ctccaccctc 6540 

cctcaggccg tgatgggctc ttcatacgga ttccaatact ctcctggaca gcgggtcgag 6600 

ttcctggtga atgcctggaa agcgaagaaa tgccctatgg gcttcgcata tgacacccgc 6660 

tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 6720 

tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 6780 

gggggccccc tgactaattc taaagggcag aactgcggct atcgccggtg ccgcgcgagc 6840 

ggtgtactga cgaccagctg cggtaatacc ctcacatgtt acttgaaggc cgctgcggcc 6900 

tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcggagacga ccttgtcgtt 6960 

atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 7 020 

atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 7080 

ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 7140 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 7200 

actccagtca attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 7260 

atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 7320 

ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 7380 
attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 7440 

aatagggtgg cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 7500 
cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 7560 
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aagtacctct tcaactgggc 
tcccagttgg atttatccag 
agcctgtctc gtgcccgacc 
gtaggcatct atctactccc 
ccatcctgtt tttttccctt 
ttttctcctt tttttttcct 
cctagtcacg gctagctgtg 
tggcctctct gcagatcaag 



agtaaggacc aagctcaaac 
ctggttcgtt. gctggttaca 
ccgctggttc atgtggtgcc 
caaccgatga acggggagct 
tttttttttc tttttttttt 
ctttttttcc ttttctttcc 
aaaggtccgt gagccgcttg 
t 



tcactccaat cccggctgcg 7 620 
gcgggggaga catatatcac 7680 
tactcctact ttctgtaggg 7740 
aaacactcca ggccaatagg 7800 
tttttttttt tttttttttt 7860 
tttggtggct ccatcttagc 7920 
actgcagaga gtgctgatac 7980 

8001 
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<210>- 4 
<211> 7989 
<212> DNA 

<213> Hepatitis C Virus 



<40G> 4 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcctgcga 

ctcaaagaaa 

cggccgcttg 

ctgatgccgc 

acctgtccgg 

cgacgggcgt 

tgctattggg 

aagtatccat 

cattcgacca 

ttgtcgatca 

ccaggctcaa 

gcttgccgaa 

tgggtgtggc 

ttggcggcga 

agcgcatcgc 

gtttccctct 

cgaagccgct 

ccgtcttttg 

aggggtcttt 

gttcctctgg 

aaccccccac 

gcaaaggcgg 

tggctctcct 

atgggatctg 

aacgtctagg 

atggcgccta 

agcctcacag 

acacaatctt 

ggctcaaaga 

caggacctcg 

ggcagctcgg 

ggcgacagca 

ggcggtccac 

acccgagggg 

cggtccccgg 

gcccatctac 

gcccaagggt 

gcgtatatgt 

accacgggtg 

tctgggggcg 

atcctgggca 

ctcgccaccg 

gctctgtcca 

aaggggggga 

aagctgtccg 

ataccaacta 

ggcgatttcg 

ctggacccga 

cagcggcgag 

gaacggccct 

gcttggtacg 

ccagggttgc 

acccacatag 

ctggtagcat 

caaatgtgga 



cattgggggc 
caaagcgtct 
cggagagcca 
tcctttcttg 
tagccgagta 
grgccccggg 
aaccaaaggg 
cgtggagagg 
cgtgttccgg 
tgccctgaat 
tccttgcgca 
cgaagtgccg 
catggctgat 
ccaagcgaaa 
cgatgatctg 
ggcgcgcatg 
tatcatggtg 
ggaccgctat 
atgggctgac 
cttctatcgc 
agcgggatca 
tggaataagg 
gcaatgtgag 
cccctctcgc 
aagcttcttg 
ctggcgacag 
cacaacccca 
caagcgtatt 
atctggggcc 
ccccccgaac 
ttacggccta 
gccgggacag 
tcctggcgac 
cccttgccgg 
tcggctggca 
acctttactt 
gggggagcct 
tgctctgccc 
ttgcgaaggc 
tcttcacgga 
acgcccctac 
ataaggtgct 
ctaaggcaca 
cccccatcac 
cctatgacat 
tcggcacagt 
ctacgcctcc 
gcactggaga 
ggcacctcat 
gcctcggact 
gcggagacgt 
actcagtgat 
ccttcaccat 
gcaggactgg 
cgggcatgtt 
agctcacgcc 
ccgtctgcca 
acgcccattt 
accaggctac 
agtgtctcat 



gacactccac 
agccatggcg 
tagrggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
cgcgccatga 
ctartcggct 
ctgtcagcgc 
gaactgcagg 
gctgtgctcg 
gggcaggatc 
gcaatgcggc 
catcgcatcg 
gacgaagagc 
cccgacggcg 
gaaaatggcc 
caggacatag 
cgcttcctcg 
cttcttgacg 
attccgcccc 
ccggtgtgcg 
ggcccggaaa 
caaaggaatg 
aagacaaaca 
gtgcctctgc 
gtgccacgtt 
caacaagggg 
tcggtgcaca 
cacggggacg 
ctcccaacag 
gaaccaggtc 
ctgcgtcaat 
cccaaagggc 
agcgcccccc 
ggtcacgagg 
actctccccc 
ctcggggcac 
ggtggacttt 
caactcgtcc' 
tggtagcggc 
tgtcctgaac 
tggtatcgac 
gtactccacc 
cataatatgt 
cctggaccaa 
gggatcggtc 
a at cccct tt 
tttctgccat 
caatgctgta 
cattgtcgta 
cgactgcaat 
tgagacgacg 
taggggcagg 
cgattcctcg 
cgccgagacc 
ggaccatctg 
cttgtcccag 
ggtgtgcgcc 
acggctaaag 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ttgaacaaga 
atgactgggc 
aggggcgccc 
acgaggcagc 
acgttgtcac 
tcctgtcatc 
ggctgcatac 
agcgagcacg 
atcaggggct 
aggatctcgt 
gcttttctgg 
cgttggctac 
tgctttacgg 
agttcttctg 
tctccctccc 
tttgtctata 
cctggccctg 
caaggtctgt 
acgtctgtag 
ggccaaaagc 
gtgagttgga 
ctgaaggatg 
tgctttacat 
tggttttcct 
acgcgaggcc 
gagggggagg 
ggcgtgtgtt 
ccaatcaccc 
ggggcgcgtt 
catgccgatg 
aggcccgtct 
gctgtgggca 
gtacccgtcg 
'cctccggccg 
aagagcacta 
ccgtccgtcg 
cctaacatca 
tatggcaagt 
gatgagtgcc 
gcggagacgg 
accgtgccac 
tatggcaaag 
tccaagaaga 
gcatattacc 
gcaacggacg 
acatgtgtca 
accgtgccac 
atgggcattt 
gttctgtgcg 
tcagttaggt 
gagttctggg 
actaagcagg 
agggctcagg 
cctacgctgc 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
tggattgcac 
acaacagaca 
ggttcttttt 
gcggctatcg 
tgaagcggga 
tcaccttgct 
gcttgatccg 
tactcggatg 
cgcgccagcc 
cgtgacccat 
attcatcgac 
ccgtgatatt 
tatcgccgct 
agtttaaaca 
ccccccctaa 
tgttattttc 
tcttcttgac 
tgaatgtcgt 
cgaccctttg 
cacgtgtat a 
tagttgtgga 
cccagaaggt 
gtgtttagtc 
ttgaaaaaca 
tacttggctg 
tccaagtggt 
ggactgtcta 
aaatgtacac 
ccttgacacc 
tcattccggt 
cctacttgaa 
tctttcgggc 
agtctatgga 
taccgcagac 
aggtgccggc 
ccgccaccct 
gaaccggggt 
ttcttgccga 
actcaactga 
ctggagcgcg 
atccaaacat 
ccatccccat 
aatgtgatga 
ggggccttga 
ctctaatgac 
cccagacagt 
aagacgcggt 
acaggtttgt 
agtgctatga 
tgcgggctta 
agagcgtctt 
caggagacaa 
ctccacctcc 
acgggccaac 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gcaggttctc 
atcggctgct 
gtcaagaccg 
tggctggcca 
agggactggc 
cctgccgaga 
gctacctgcc 
gaagccggtc 
gaactgttcg 
ggcgatgcct 
tgtggccggc 
gctgaagagc 
cccgattcgc 
gaccacaacg 
cgttactggc 
caccatattg 
gagcattcct 
gaaggaagca 
caggcagcgg 
agatacacct 
aagagtcaaa 
accccattgt 
gaggttaaaa 
cgataatacc 
catcatcact 
ctccaccgca 
tcatggtgcc 
caatgtggac 
atgcacctgc 
gcgccggcgg 
gggctcttcg 
tgccgtgtgc 
aaccactatg 
attccaggtg 
tgcgtatgca 
aggtttcggg 
aaggaccatc 
cggtggttgc 
ctcgaccact 
actcgtcgtg 
cgaggaggtg 
cgagaccatc 
gctcgccgcg 
tgtatccgtc 
gggctttacc 
cgacttcagc 
gtcacgctcg 
gactccagga 
cgcgggctgt 
cctaaacaca 
tacaggcctc 
cttcccctac 
atcgtgggac 
gcccctgctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 



CA 02303526 2000-03-31 



tataggctgg 
atggcatgca 
gtcctagcag 
atcatcttgt 
gatgagatgg 
gaacaattca 
gctgctcccg 
tggaatttca 
gcgatagcat 
accctcctgt 
gcttctgctt 
aaggtgcttg 
tttaaggt ca 
atcctctccc 
gtgggcccag 
ggtaaccacg 
cagatcctct 
gaggactgct 
acggtgttga 
gtccccttct 
caaaccacct 
atcgtggggc 
accacgggcc 
gctgctgagg 
accactgaca 
gatggggtgc 
acattcctgg 
ccggacgtag 
gctaagcgta 
ctgtctgcgc 
ctcatcgagg 
tcagaaaata 
agggaagtat 
cccatatggg 
tacgtccctc 
cctccacgga 
gagctcgcca 
acggcctctc 
tcctccatgc 
tctaccgtaa 
acaggcgccc 
agcaactctt 
ctgcggcaga 
gtgctcaagg 
gaagcctgta 
gacgtccgga 
ctggaagaca 
gtccaaccag 
gttcgtgtgt 
atgggctctt 
gcctggaaag 
acggtcactg 
cccgaagcca 
actaattcta 
accagctgcg 
aagctccagg 
gcggggaccc 
tctgcccccc 
tcctccaatg 
gaccccacca 
tcctggctag 
actcatttct 
atctacgggg 
catggcctta 
tcatgcctca 
gtccgcgcta 
aactgggcag 



gagccgttca 
tgtcggctga 
ctctggccgc 
ccggaaagcc 
aagagtgcgc 
aacagaaggc 
tggtggaatc 
tcagcgggat 
cactgatggc 
ttaacatcct 
tcgtaggcgc 
tggatatttt 
tgagcggcga 
ctggcgccct 
gggagggggc 
tctcccccac 
ctagtcttac 
ccacgccatg 
ctgatttcaa 
tctcatgtca 
gcccatgtgg 
ctaggacctg 
cctgcacgcc 
agtacgtgga 
acgtaaagtg 
ggttgcacag 
rcgggctcaa 
cagtgctcac 
ggctggccag 
cttccttgaa 
ccaacctcct 
aggtagtaat 
ccgttccggc 
cacgcccgga 
cagtggtaca 
ggaagaggac 
caaagacctt 
ctgaccagcc 
ccccccttga 
gcgaggaggc 
tgatcacgcc 
tgctccgtca 
agaaggtcac 
agatgaaggc 
agctgacgcc 
acctatccag 
ctgagacacc 
agaagggggg 
gcgagaaaat 
catacggatt 
cgaagaaatg 
agaatgacat 
gacaggccat 
aagggcagaa 
gtaataccct 
actgcacgat 
aagaggacga 
ctggggaccc 
tgtcagtcgc 
ccccccttgc 
gcaacatcat 
tctccatcct 
cctgttactc 
gcgcattttc 
ggaaacttgg 
ggctactgtc 
taaggaccaa 



aaacgaggtt 
cctggaggtc 
grattgcctg 
ggccatcatt 
ctcacacctc 
aatcgggttg 
caagtggcgg 
acaatatt ta 
attcacagcc 

ggggggatgg 

cggcatcgct 
ggcaggttat 
gatgccctcc 
agtcgtcggg 
tgtgcagtgg 
gcactatgtg 
catcactcag 
ctccggctcg 
gacctggctc 
acgtgggtac 
agcacagatc 
tagtaacacg 
ctccccggcg 
ggttacgcgg 
cccgtgtcag 
gtacgctcca 
tcaatacc-g 
ttccatgctc 
gggatctccc 
ggcaacatgc 
gtggcggcag 
tttggactct 
ggagatcctg 
ttacaaccct 
cgggtgtcca 
ggttgtcctg 
cggcagctcc 
ctccgacgac 

gggggagccg 

tagtgaggac 

atgcgctgcg 

ccacaacttg 

ctttgacaga 

gaaggcgtcc 

cccacattcg 

caaggccgtt 

aattgacacc* 

ccgcaagcca 

ggccctttac 

ccaatactct 

ccctatgggc 

ccgtgttgag 

aaggtcgctc 

ctgcggctat 

cacatgttac 

gctcgtatgc 

ggcgagccta 

gcccaaacca 

gcacgatgca 

gcgggctgcg 

catgtatgcg 

tctagctcag 

cattgagcca 

actccatagt 

ggtaccgccc 

ccaggggggg 

gctcaaactc 



actaccacac 

gtcacgagca 

acaacaggca 

cccgacaggg 

ccttacatcg 

ctgcaaacag 

accctcgaag 

gcaggcttgt 

tctatcacca 

gtggccgccc 

ggagcggctg 

ggagcagggg 

accgaggacc 

gtcgtgtgcg 

atgaaccggc 

cctgagagcg 

ctgctgaaga 

tggctaagag 

cagtccaagc 

aagggagtct 

accggacatg 

tggcatggaa 

ccaaattatt 

gtgggggatt 

gttccggccc 

gcgtgcaaac 

gttgggtcac 

accgacccct 

ccctccttgg 

actacccgtc 

gagatgggcg 

ttcgagccgc 

cggaggtcca 

ccactgttag 

ttgccgcctg 

tcagaatcta 

gaatcgtcgg 

ggcgacgcgg 

ggggatcccg 

gtcgtctgct 

gaggaaacca 

gtctatgcta 

ctgcaggtcc 

acagttaagg 

gccagatcta 

aaccacatcc 

accatcatgg 

gctcgcctta 

gatgtggtct ' 

cctggacagc 

ttcgcatatg 

gagtcaatct 

acagagsggc 

cgccggtgcc 

ttgaaggccg 

ggagacgacc 

cgggccttca 

gaatacgact 

tctggcaaaa 

tgggagacag 

cccaccttgt 

gaacaacttg 

cttgacctac 

tactctccag 

ttgcgagtct 

agggctgcca 

actccaatcc 



accccataac 

cctgggtgcc 

gcgtggtcat 

aagtccttta 

aacagggaat 

ccaccaagca 

ccttctgggc 

ccactctgcc 

gcccgctcac 

aacttgctcc 

ttggcagcat 

tggcaggcgc 

tggttaacct 

cagcgatact 

tgatagcgtt 

acgctgcagc 

ggcttcacca 

atgtttggga 

tcctgccgcg 

ggcggggcga 

cgaaaaacgg 

cattccccat 

ctagggcgct 

rccactacgt 

ccgaattctt 

ccctcctacg 

agctcccatg 

cccacattac 

ccagctcatc 

atgactcccc 

ggaacatcac 

tccaagcgga 

ggaaattccc 

agtcctggaa 

ccaaggcccc 

ccgtgtcttc 

ccgtcgacag 

gatccgacgt 

atctcagcga 

gctcgatgtc 

agctgcccat 

caacatctcg 

tggacgacca 

ctaaacttct 

aatttggcta 

gctccgtgtg 

caaaaaatga 

tcgtattccc 

ccaccctccc 

gggtcgagtt 

acacccgctg 

accaatgttg 

tttacatcgg 

gcgcgagcgg 

ctgcggcctg 

ttgtcgttat 

cggaggctat 

tggagttgat 

gggtgtacta 

ctagacacac 

gggcaaggat 

aaaaagccct 

ctcagatcat 

gtgagatcaa 

ggagacatcg 

cttgtggcaa 

cggctgcgtc 



caaatacatc 

ggtaggcgga 

tgtgggcagg 

ccgggagttc 

gcagctcgcc 

agcggaggct 

gaagcatatg 

tggcaacccc 

cacccaacat 

tcccagcgct 

aggccttggg 

gctcgtggcc 

actccctgct 

gcgtcggcac 

cgcttcgcgg 

acgtgtcact 

gtggatcaac 

ttggatatgc 

attgccggga 

cggcatcatg 

ttccatgagg 

taacgcgtac 

gtggcgggtg 

gacgggcatg 

cacagaagtg 

ggaggaggtc 

cgagcccgaa 

ggcggagacg 

agctagccag 

ggacgctgac 

ccgcgtggag 

ggaggatgag 

tcgagcgatg 

ggacccggac 

tccgatacca 

tgccttggcg 

cggcacggca 

tgagtcgtac 

cgggtcttgg 

ctacacatgg 

caatgcactg 

cagcgcaagc 

ctaccgggac 

atccgtggag 

tggggcaaag 

gaaggacttg 

ggttttctgc 

agatttgggg 

tcaggccgtg 

cctggtgaat 

ttttgactca 

tgacttggcc 

gggccccctg 

tgtactgacg 

tcgagctgcg 

ctgtgaaagc 

gactagatac 

aacatcatgc 

tctcacccgt 

tccagtcaat 

gatcctgatg 

agattgtcag 

tcaacgactc 

tagggtggct 

ggccagaagt 

gtacctcttc 

ccagttggat 



3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4 620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 
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ttatccagct ggttcgttgc tggttacagc 

gcccgacccc gctggttcat gtggtgccta 

ctactcccca accgatgaac ggggagctaa 

tttccctttt tttttttctt tttttttttt 

tttttcctct ttttttcctt ttctttcctt 

tagctgtgaa aggtccgtga gccgcttgac 
agatcaagt 



gggggagaca tatatcacag cctgtctcgt 7 680 

ctcctacttt ctgtaggggt aggcatctat 7740 

acactccagg ccaataggcc atcctgtttt 7800 

tttttttttt tttttttttt ttctcctttt 7860 

tggtggctcc atcttagccc tagtcacggc 7920 

tgcagagagt gctgatactg gcctctctgc 7980 

7989 
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<210> 5 
<211> 8649 
<212> DNA 

<213> Hepatitis C Virus 



<400> 5 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcrtgcga 

ctcaaagaaa 

acgcaggttc 

csatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaactgtt 

atggcgatgc 

actgtggccg 

ttgctgaaga 

ctcccgattc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcattc 

gtgaaggaag 

tgcaggcagc 

taagatacac 

gaaagagtca 

gtaccccatt 

tcgaggttaa 

cacgataata 

ctgatactct 

ttacaatatt 

gttcgggggg 

tttaccatca 

ataaccaaag 

cggaaggttg 

ggtacgtacg 

gaccttgcgg 

tggggggcag 

agggggaggg 

ctcgcgccta 

agcctcacag 

acacaatctt 

ggctcaaaga 

caggacctcg 

ggcagctcgg 

ggcgacagca 

ggcggtccac 

acccgagggg 

cggtccccgg 

gcccatctac 

gcccaagggt 

gcgtatatgt 

accacgggtg 

tctgggggcg 

atcctgggca 

ctcgccaccg 

gctctgtcca 

aaggggggga 



gattgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gcrgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggatc 
aaaacgtcta 
ccatggaccg 
tgaccttgtc 
ttatcaccag 
gccgcgatgc 
ccaaaatctt 
tgccgtactt 
ctgggggtca 
tttatgacca 
tggcagttga 
acaccgcggc 
agatacatct 
ttacggccta 
gccgggacag 
tcctggcgac 
cccttgccgg 
tcggctggca 
acctttactt 
gggggagcct 
tgctctgccc 
ttgcgaaggc 
tcttcacgga 
acgcccctac 
ataaggtgct 
ctaaggcaca 
cccccatcac 
cctatgacat 
tcggcacagt 
ctacgcctcc 
gcactggaga 
ggcacctcat 



gacactccac 
agccatggcg 
tagtggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgtgttcc 
ggrgccctga 
gttccttgcg 
ggcgaagtgc 
atcatggctg 
caccaagcga 
caggatgatc 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaatgggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
ggccccccga 
ggagatggca 
accgcactat 
ggccgaggca 
cgtcatcctc 
gctcgccata 
cgtgcgcgca 
ttatgtccaa 
tctcacccca 
gcccgtcgtc 
gtgtggggac 
gggaccggca 
ctcccaacag 
gaaccaggtc 
ctgcgtcaat 
cccaaagggc 
agcgcccccc 
ggtcacgagg 
actctccccc 
ctcggggcac 
ggtggacttt 
caactcgtcc 
tggtagcggc 
tgtcctgaac 
tggtatcgac 
gtactccacc 
cataatatgt 
cctggaccaa 
gggatcggtc 
aatccccttt 
tttctgccat 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
gcatcgtgcg 
aagctgttcc 
cacttgcaag 
ctcacgtgcg 
ctcggtccac 
cacgggctca 
atggctctca 
ctgcgggact 
ttctctgata 
'atcatcttgg 
gacagccttg 
acgcgaggcc 
gagggggagg 
ggcgtgtgtt 
ccaatcaccc 
ggggcgcgtt 
catgccgatg 
aggcccgtct 
gctgtgggca 
gtacccgtcg 
cctccggccg 
aagagcacta 
ccgtccgtcg 
cctaacatca 
tatggcaagt 
gatgagtgcc 
gcggagacgg 
accgtgccac 
tatggcaaag 
tccaagaaga 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgascaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcar 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
gaggcgcggt 
tcgctaggct 
tgtggatccc 
cgatccaccc 
tcatggtgct 
ttcgtgcatg 
tgaagttggc 
gggcccacgc 
tggagaccaa 
gcctgcccgt 
aagggcaggg 
tacttggctg 
tccaagtggt 
ggactgtcta 
aaatgtacac 
ccttgacacc 
tcattccggt 
cctacttgaa 
tctttcgggc 
agtctatgga 
taccgcagac 
aggtgccggc 
ccgccaccct 
gaaccggggt 
ttcttgccga 
actcaactga 
ctggagcgcg 
atccaaacat 
ccatccccat 
aatgtgatga 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gatggattgc 
gcacaacaga 
ccggttcttt 
gcgcggc~at 
actgaagcgg 
tctcaccttg 
acgcttgatc 
cgtactcgga 
ctcgcgccag 
gtcgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
ctttgaaaaa 
tttcgtaggt 
catatggtgg 
ccccctcaac 
agagctaatc 
ccaggctggt 
catgctggtg 
cgcactgaca 
gggcctacga 
ggttatcacc 
ctccgcccgc 
gtggcgactc 
catcatcact 
ctccaccgca 
tcatggtgcc 
caatgtggac 
atgcacctgc 
gcgccggcgg 
gggctcttcg 
tgccgtgtgc 
aaccactatg 
attccaggtg 
tgcgtatgca 
aggtttcggg 
aaggaccatc 
cggtggttgc 
ctcgaccact 
actcgtcgtg 
cgaggaggtg 
cgagaccatc 
gctcgccgcg 



60 

120 

XBO 

240 

300 

360 

420 

480 

540 

600 

660 

720 

7B0 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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aagctgtccg gcctcggact caatgctgta 
ataccaacta gcggagacgt cattgtcgta 
ggcgatttcg actcagtgat cgactgcaat 
ctggacccga ccttcaccat tgagacgacg 
cagcggcgag gcaggactgg taggggcagg 
gaacggccct cgggcatgtt cgaitcctcg 
gcttggtacg agctcacgcc cgccgagacc 
ccagggttgc ccgtctgcca ggaccatctg 
acccacatag acgcccattt cttgtcccag 
ctggtagcat accaggctac ggtgtgcgcc 
caaatgtgga agtgtctcat acggctaaag 
tataggctgg gagccgttca aaacgaggtt 
atggcatgca tgtcggctga cctggaggtc 
gtcctagcag ctctggccgc gtattgcctg 
atcatcttgt ccggaaagcc ggccatcatt 
gatgagatgg aagagtgcgc ctcacacctc 
gaacaattca aacagaaggc aatcgggttg 
gctgctcccg tggtggaatc caagtggcgg 
tggaatttca tcagcgggat acaatattta 
gcgatagcat cactgatggc attcacagcc 
accctcctgt ttaacatcct ggggggatgg 
gcttctgctt tcgtaggcgc cggcatcgct 
aaggtgcttg tggatatttt ggcaggttat 
tttaaggtca tgagcggcga gatgccctcc 
atcctctccc ctggcgccct agtcgtcggg 
gtgggcccag gggagggggc tgtgcagtgg 
ggtaaccacg tctcccccac gcactatgtg 
cagatcctct ctagtcttac catcactcag 
gaggactgct ccacgccatg ctccggctcg 
acggtgttga ctgatttcaa gacctggctc 
gtccccttct tctcatgtca acgtgggtac 
caaaccacct gcccatgtgg agcacagatc 
atcgtggggc ctaggacctg tagtaacacg 
accacgggcc cctgcacgcc ctccccggcg 
gctgctgagg agtacgtgga ggtzacgcgg 
accactgaca acgtaaagtg cccgtgtcag 
gatggggtgc ggttgcacag gtacgctcca 
acattcctgg tcgggctcaa tcaatacctg 
ccggacgtag cagtgctcac ttccatgctc 
gctaagcgta ggctggccag gggatctccc 
ctgtctgcgc cttccttgaa ggcaacatgc 
ctcatcgagg ccaacctcct gtggcggcag 
tcagaaaata aggtagtaat tttggactct 
agggaagtat ccgttccggc ggagatcctg 
cccatatggg cacgcccgga ttacaaccct 
tacgtccctc cagtggtaca cgggtgtcca 
cctccacgga ggaagaggac ggttgtcctg 
gagctcgcca caaagacctt cggcagctcc 
acggcctctc ctgaccagcc ctccgacgac 
tcctccatgc ccccccttga gggggagccg 
tctaccgtaa gcgaggaggc tagtgaggac 
acaggcgccc tgatcacgcc atgcgctgcg 
agcaactctt tgctccgtca ccacaacttg 
ctgcggcaga agaaggtcac ctttgacaga 
gtgctcaagg agatgaaggc gaaggcgtcc 
gaagcctgta agctgacgcc cccacattcg 
gacgtccgga acctatccag caaggccgtt 
ctggaagaca ctgagacacc aattgacacc 
gtccaaccag agaagggggg ccgcaagcca 
gttcgtgtgt gcgagaaaat ggccctttac 
atgggctctt catacggatt ccaatactct 
gcctggaaag cgaagaaatg ccctatgggc 
acggtcactg agaatgacat ccgtgttgag 
cccgaagcca gacaggccat aaggtcgctc 
actaattcta aagggcagaa ctgcggctat 
accagctgcg gtaataccct cacatgttac 
aagctccagg actgcacgat gctcgtatgc 



gcatattacc ggggccttga tgtatccgtc 3660 
gcaacggacg ctctaatgac gggctttacc 3720 
acatgtgtca cccagacagt cgacttcagc 3780 
accgtgccac aagacgcggt gtcacgctcg 3840 
atgggcattt acaggtttgt gactccagga 3900 
gttctgtgcg agtgctatga cgcgggctgt 3960 
tcagttaggt tgcgggctta cctaaacaca 4020 
gagttctggg agagcgtctt tacaggcctc 4080 
actaagcagg caggagacaa cttcccctac 4140 
agggctcagg ctccacctcc atcgtgggac 4200 
cctacgctgc acgggccaac gcccctgctg 4260 
actaccacac accccataac caaatacatc 4320 
gtcacgagca cctgggtgct ggtaggcgga 4380 
acaacaggca gcgtggtcat tgtgggcagg 4440 
cccgacaggg aagtccttta ccgggagttc 4500 
ccttacatcg aacagggaat gcagctcgcc 4560 
ctgcaaacag ccaccaagca agcggaggct 4620 
accctcgaag ccttctgggc gaagcatatg 4 680 
gcaggcttgt- ccactctgcc tggcaacccc 4740 
tctatcacca gcccgctcac cacccaacat 4800 
gtggccgccc aacttgctcc tcccagcgct 4860 
ggagcggctg ttggcagcat aggccttggg 4 920 
ggagcagggg tggcaggcgc gctcgtggcc 4 980 
accgaggacc tggttaacct actccctgct 5040 
gtcgtgtgcg cagcgatact gcgtcggcac 5100 
atgaaccggc tgatagcgtt cgcttcgcgg 5160 
cctgagagcg acgctgcagc acgtgtcact 5220 
ctgctgaaga ggcttcacca gtggatcaac 5280 
tggctaagag atgtttggga ttggatatgc 5340 
cagtccaagc tcctgccgcg attgccggga 5400 
aagggagtct ggcggggcga cggcatcatg 5460 
accggacatg tgaaaaacgg ttccatgagg 5520 
tggcatggaa cattccccat taacgcgtac 5580 
ccaaattatt ctagggcgct gtggcgggtg 5640 
gtgggggatt tccactacgt gacgggcatg 5700 
gttccggccc ccgaattctt cacagaagtg 5760 
gcgtgcaaac ccctcctacg ggaggaggtc 5820 
gttgggtcac agctcccatg cgagcccgaa 5880 
accgacccct cccacattac ggcggagacg 5940 
ccctccttgg ccagctcatc agctagccag 6000 
actacccgtc atgactcccc ggacgctgac 6060 
gagatgggcg ggaacatcac ccgcgtggag 6120 
ttcgagccgc tccaagcgga ggaggatgag 6180 
cggaggtcca ggaaattccc tcgagcgatg 6240 
ccactgttag agtcctggaa ggacccggac 6300 
ttgccgcctg ccaaggcccc tccgatacca 6360 
tcagaatcta ccgtgtcttc tgccttggcg 6420 
gaatcgtcgg ccgtcgacag cggcacggca 6430 
ggcgacgcgg gatccgacgt tgagtcgtac 6540 
ggggatcccg atctcagcga cgggtcttgg 6600 
gtcgtctgct gctcgatgtc ctacacatgg 6660 
gaggaaacca agctgcccat caatgcactg 6720. 
gtctatgcta caacatctcg cagcgcaagc 6780 
ctgcaggtcc tggacgacca ctaccgggac 6840 
acagttaagg ctaaacttct atccgtggag 6900 
gccagatcta aatttggcta tggggcaaag 6960 
aaccacatcc gctccgtgtg gaaggacttg 7020 
accatcatgg caaaaaatga ggttttctgc 7080 
gctcgcctta tcgtattccc agatttgggg 7140 
gatgtggtct ccaccctccc tcaggccgtg 7200 
cctggacagc gggtcgagtt cctggtgaat 7260 
ttcgcatatg acacccgctg ttttgactca 7320 
gagtcaatct accaatgttg tgacttggcc 7380 
acagagcggc tttacatcgg gggccccctg 7440 
cgccggtgcc gcgcgagcgg tgtactgacg 7 500 
ttgaaggccg ctgcggcctg tcgagctgcg 7560 
ggagacgacc ttgtcgttat ctgtgaaagc 7 620 
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gcggggaccc aagaggacga ggcgagccta 
tctgcccccc ctggggaccc gcccaaacca 
tcctccaatg tgtcagtcgc gcacgatgca 
gaccccacca ccccccttgc gcgggctgcg 
tcctggctag gcaacatcar catgtatgcg 
actcatttct tctccatcct tctagctcag 
atctacgggg cctgttactc cattgagcca 
catggcctta gcgcattttc actccatagt 
tcatgcctca ggaaacttgg ggtaccgccc 
gtccgcgcta ggctactgtc ccaggggggg 
aactgggcag taaggaccaa gctcaaactc 
ttatccagct ggttcgtrgc tggttacagc 
gcccgacccc gctggttcat gtggtgccta 
ctactcccca accgatgaac ggggagctaa 
tttccctttt tttttttctt tttttttttt 
tttttcctct ttttttcctt ttctttcctt 
tagctgtgaa aggtccgtga gccgcttgac 
agatcaagt 



cgggccttca cggaggctat gactagatac 7680 

gaatacgact tggagttgat aacatcatgc 7740 

tctggcaaaa gggtgtacta tctcacccgt 7800 

tgggagacag ctagacacac tccagtcaat 78 60 

cccaccttgt gggcaaggat gatcctgatg 7920 

gaacaacttg aaaaagccct agattgtcag 7980 

cttgacctac ctcagatcat tcaacgactc 8040 

tactctccag gtgagatcaa tagggtggct 8100 

ttgcgagtct ggagacatcg ggccagaagt. 8160 

agggctgcca cttgtggcaa gtacctcttc 8220 

actccaatcc cggctgcgtc ccagttggat 8280 

gggggagaca tatatcacag cctgtctcgt 8340 

ctcctacttt ctgtaggggt aggcatctat 8400 

acactccagg ccaataggcc atcctgtttt 8460 

tttttttttt tttttttttt ttctcctttt 8520 

tggtggctcc atcttagccc tagtcacggc 8580 

tgcagagagt gctgatactg gcctctctgc 8640 

8649 
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<210> 6 
<211> 8001 
<212> DNA . 

<213> Hepatitis C Virus 
<400> 6 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttctrg gatcaacccg 
gcgagactgc tagccgagna gtgttgggtc 
grgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaacgt aacaccaacg 
acgcaggttc tccggccgcr tgggtggaga 
caatcggctg ctctgatgcc gccgtgttcc 
ttgtcaagac cgacctgtcc ggtgccctga 
cgtggctggc cacgacgggc gttccttgcg 
gaagggactg gctgctattg ggcgaagtgc 
ctcctgccga gaaagtatcc atcatggctg 
cggctacctg cccattcgac caccaagcga 
tggaagccgg tcttgtcgat caggatgatc 
ccgaac.gtt cgccaggctc aaggcgcgca 
atggcgatgc ctgcttgccg aatatcatgg 
actgtggccg gctgggtgtg ccggaccgct 
ttgctgaaga gcttggcggc gaatgggctg 
ctcccgattc gcagcgcatc gccttctatc 
cagaccacaa cggtttccct ctagcgggat 
aacgttactg gccgaagccg cttggaataa 
tccaccatat tgccgtcttt tggcaatgtg 
acgagcattc ctaggggtct ttcccctctc 
gtgaaggaag cagttcctct ggaagcttct 
tgcaggcagc ggaacccccc acctggcgac 
taagatacac ctgcaaaggc ggcacaaccc 
gaaagagtca aatggctctc ctcaagcgta 
gtaccccatt gtatgggatc tgatctgggg 
tcgaggttaa aaaacgtcta ggccccccga 
cacgataata ccatggcgcc tattacggcc 
tgcatcatca ctagcctcac aggccgggac 
gtctccaccg caacacaatc tttcctggcg 
tatcatggtg ccggctcaaa gacccttgcc 
accaatgtgg accaggacct cgtcggctgg 
ccatgcacct gcggcagctc ggacctttac 
gtgcgccggc ggggcgacag cagggggagc 
aagggctctt cgggcggtcc actgctctgc 
gctgccgtgt gcacccgagg ggttgcgaag. 
gaaaccacta tgcggtcccc ggtcttcacg 
acattccagg tggcccatct acacgcccct 
gctgcgtatg cagcccaagg gtataaggtg 
ctaggtttcg gggcgtatat gtctaaggca 
gtagggacca- tcaccacggg tgcccccatc 
gacggtggtt gctctggggg cgcctatgac 
gactcgacca ctatcctggg catcggcaca 
cgactcgtcg tgctcgccac cgctacgcct 
atcgaggagg tggctctgtc cagcactgga 
atcgcgacca tcaagggggg gaggcacctc 
gagctcgccg cgaagctatc cggcctcgga 
gatgtatccg tcataccaac tagcggagac 
acgggcttta ccggcgattt cgactcagtg 
gtcgacttca gcctggaccc gaccttcacc 
gtgtcacgct cgcagcggcg aggcaggact 
gtgactccag gagaacggcc ctcgggcatg 
gacgcgggct gtgcttggta cgagctcacg 
tacctaaaca caccagggct gcccgtctgc 
tttacaggcc tcacccacat agacgcccat 
aacttcccct acctggtagc ataccaggct 
ccatcgtggg accaaatgtg ggagtgtctc 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
caccgtgcac catgagcacg aatcctaaac 360 
ggcgcgccat gattgaacaa gatggattgc 420 
ggctattcgg ctatgactgg gcacaacaga 480 
ggctgrcagc gcaggggcgc ccggttcttt 540 
atgaac-gca ggacgaggca gcgcggctat 600 
cagctgtgct cgacgttgtc actgaagcgg 660 
cggggcagga tctcctgtca tctcaccttg 720 
atgcaatgcg gcggctgcat acgcttgatc 780 
aacatcgcat cgagcgagca cgtactcgga B40 
tggacgaaga gcatcagggg ctcgcgccag 900 
tgcccgacgg cgaggatctc gtcgtgaccc 960 
tggaaaatgg ccgcttttct ggattcatcg 1020 
atcaggacat agcgttggct acccgtgata 1080 
accgcttcct cgtgctttac ggtatcgccg 1140 
gccttcttga cgagttcttc tgagtttaaa 1200 
caattccgcc cctctccctc ccccccccct 1260 
ggccggtgtg cgtttgtcta tatgttattt 1320 
agggcccgga aacctggccc tgtcttcttg 1380 
gccaaaggaa tgcaaggtct gttgaatgtc 1440 
tgaagacaaa caacgtctgt agcgaccctt 1500 
aggtgcctct gcggccaaaa gccacgtgta 1560 
cagtgccacg ttgtgagttg gatagttgtg 1620 
ttcaacaagg ggctgaagga tgcccagaag 1680 
cctcggtgca catgctttac atgtgtttag 1740 
accacgggga cgtggttttc ctttgaaaaa 1800 
tactcccaac agacgcgagg cctacttggc 1860 
aggaaccagg tcgaggggga ggtccaagtg 1920 
acctgcgtca atggcgtgtg ttggactgtc 1980 
ggcccaaagg gcccaatcac ccaaatgtac 204 0 
caagcgcccc ccggggcgcg ttccttgaca 2100 
ttggtcacga ggcatgccga tgtcattccg 2160 
ctactctccc ccaggcccgt ctcctacttg 2220 
ccctcggggc atgctgtggg catctttcgg 2280 
,gcggtggact ttgtacccgt cgagtctatg 2340 
'gacaactcgt cccctccggc cgtaccgcag 2400 
actggtagcg gcaagagcac taaggtgccg 2460 
cttgtcctga acccgtccgt cgccgccacc 2520 
catggtatcg accctaacat cagaaccggg 2580 
acgtactcca cctatggcaa g.tttcttgcc 2 640 
atcataatat gtgatgagtg ccactcaact 2700 
gtcctggacc aagcggagac ggctggagcg 2760 
ccgggatcgg tcaccgtgcc acatccaaac 2820 
gaaatcccct tttatggcaa agccatcccc 2880 
attttctgcc attccaagaa gaaatgtgat 2940 
ctcaatgctg tagcatatta ccggggcctt 3000 
gtcattgtcg tagcaacgga cgctctaatg 3060 
atcgactgca atacatgtgt cacccagaca 3120 
attgagacga cgaccgtgcc acaagacgcg 3180 
ggtaggggca ggatgggcat ttacaggttt 3240 
ttcgattcct cggttctgtg cgagtgctat 3300 
cccgccgaga cctcagttag gttgcgggct 3360 
caggaccatc tggagttctg ggagagcgtc 3420 
ttcttgtccc agactaggca ggcaggagac 34 80 
acggtgtgcg ccagggctca ggctccaccr 354 0 
atacggctaa agcctacgct gcacgggcca 3600 
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acgcccctgc 
accaaataca 
ctggtaggcg 
attgtgggca 
taccgggagt 
atgcagctcg 
caagcggagg 
gcgaagcata 
cctggcaacc 
accacccaac 
cctcccagcg 
ataggccttg 
gcgctcgtgg 
ctactccctg 
ctgcgtcggc 
ttcgcttcgc 
gcacgtgtca 
cagtggatca 
gattggataz 
cgattgccgg 
gacggcatca 
ggttccatga 
attaacgcgt 
ctgtggcggg 
gtgacgggca 
ttcacagaag 
cgggaggagg 
tgcgggcccg 
acggcggaga 
tcagctagcc 
ccggacgctg 
acccgcgtgg 
gaggaggatg 
cctcgagcga 
aaggacccgg 
cctccgatac 
tctgccttgg 
agcggcacgg 
gttgagtcgt 
gacgggtctt 
tcctacacat 
gtcaatgcac 
cgcagcgcaa 
cactaccggg 
ctatccgtgg 
tatggggcaa 
tggaaggact 
gaggttttct 
ccagatttgg 
cctcaggccg 
ttcctggtga 
tgttttgact 
tgtgacttgg 
gggggccccc 
ggtgtactga 
tgtcgagctg 
atctgtgaaa 
atgactagat 
ataacatcat 
tatctcaccc 
actccagtca 
atgatcctga 
ctagattgtc 
attcaacgac 
aatagggtgg 
cgggccagaa 
aagtacctct 



tgtataggct 
tcatggcatg 
gagtcctagc 
ggatcatctt 
tcgatgagat 
ccgaacaatt 
ctgctgcrcc 
tgtggaai:tt 
ccgcgatagc 
ataccctcct 
ctgcttctgc 
ggaaggtgct 
cctttaaggt 
ctatcctctc 
acgtgggccc 
ggggtaacca 
ctcagatcct 
acgaggactg 
gcacggtgtt 
gagtcccctt 
tgca'aaccac 
ggatcgtggg 
acaccacggg 
tggctgctga 
tgaccactga 
tggatggggt 
tcacattcct 
aaccggacgt 
cggctaagcg 
agctgtctgc 
acctcatcga 
agtcagaaaa 
agagggaagt 
tgcccatatg 
actacgtccc 
cacctccacg 
cggagctcgc 
caacggcctc 
actcctccat 
ggtctaccgt 
ggacaggcgc 
tgagcaactc 
gcctgcggca 
acgtgctcaa 
aggaagcctg 
aggacgtccg 
tgctggaaga 
gcgtccaacc 
gggttcgtgt 
tgatgggctc 
atgcctggaa 
caacggtcac 
cccccgaagc 
rgactaattc 
cgaccagctg 
cgaagctcca 
gcgcggggac 
actctgcccc 
gctcctccaa 
gtgaccccac 
attcctggct 
tgactcattt 
agatctacgg 
tccatggcct 
cttcatgcct 
gtgtccgcgc 
tcaactgggc 



gggagccgtt 
catgtcggct 
agctctggcc 
gtccggaaag 
ggaagagtgc 
caaacagaag 
cgtggtggaa 
catcagcggg 
atcactgatg 
gtttaacatc 
tttcgtaggc 
tgtggatatt 
catgagcggc 
ccctggcgcc 
aggggagggg 
cgtctccccc 
ctctagtctt 
ctccacgcca 
gactgatttc 
cttctcatgt 
ctgcccatgt 
gcctaggacc 
cccctgcacg 
ggagtacgtg 
caacgtaaag 
gcggttgcac 
ggtcgggctc 
agcagtgctc 
taggctggcc 
gccttccttg 
ggccaacctc 
taaggtagta 
atccgttccg 
ggcacgcccg 
tccagtggta 
gagggagagg 
cacaaagacc 
tcctgaccag 
gccccccctt 
aagcgaggag 
cctgatcacg 
tttgctccgt 
gaagaaggtc 
ggagatgaag 
taagctgacg 
gaacctatcc 
cactgagaca 
agagaagggg 
gtgcgagaaa 
ttcatacgga 
agcgaagaaa 
tgagaatgac 
cagacaggcc 
taaagggcag 
cggtaatacc 
ggactgcacg 
ccaagaggac 
ccctggggac 
tgtgtcagtc 
cacccccctt 
aggcaacatc 
cttctccatc 
ggcctgttac 
tagcgcattt 
caggaaactt 
taggctactg 
agtaaggacc 



caaaacgagg 
gacctggagg 
gcgtattgcc 
ccggccatca 
gcctcacacc 
gcaatcgggt 
tccaagtggc 
atacaatatt 
gcattcacag 
ctggggggat 
gccggcatcg 
ttggcaggtt 
gagatgccct 
ctagtcgtcg 
gctgtgcagt 
acgcactatg 
accaicactc 
tgctccggct 
aagacctggc 
caacgtgggt 
ggagcacaga 
tgtagtaaca 
ccctccccgg 
gaggttacgc 
tgcccgtgtc 
aggtacgctc 
aatcaatacc 
acttccatgc 
aggggatctc 
aaggcaacat 
ctgtggcggc 
attttggact 
gcggagatcc 
gattacaacc 
cacgggtgtc 
acggttgtcc 
ttcggcagct 
ccctccgacg 
gagggggagc 
gctagtgagg 
ccatgcgctg 
caccacaact 
acctttgaca 
gcgaaggcgt 
cccccacatt 
agcaaggccg 
ccaattgaca 
ggccgcaagc 
atggcccttt 
ttccaatact 
tgccctatgg 
atccgtgttg 
ataaggtcgc 
aactgcggct 
ctcacatgtt 
atgctcgtat 
gaggcgagcc 
ccgcccaaac 
gcgcacgatg 
gcgcgggctg 
atcatgtatg 
cttctagctc 
tccattgagc 
tcactccata 
ggggtaccgc 
tcccaggggg 
aagctcaaac 



ttactaccac 
tcgtcacgag 
tgacaacagg 
ttcccgacag 
tcccttacat 
tgctgcaaac 
ggaccctcga 
tagcaggctt 
cctctatcac 
gggtggccgc 
ctggagcggc 
atggagcagg 
ccaccgagga 
gggtcgtgtg 
ggatgaaccg 
tgtctgagag 
agctgctgaa 
cgtggctaag 
tccagtccaa 
acaagggagt 
tcaccggaca 
cgtggcatgg 
cgccaaatta 
gggtggggga 
aggttccggc 
cagcgtgcaa 
tggttgggtc 
tcaccgaccc 
ccccctcctt 
gcactacccg 
aggagatggg 
ctttcgagcc 
tgcggaggtc 
ctccactgtt 
cattgccgcc 
tgtcagaatc 
ccgaatcgtc 
acggcgacgc 
cgggggatcc 
acgtcgtctg 
cggaggaaac 
tggtctatgc 
gactgcaggt 
ccacagttaa 
cggccagatc 
ttaaccacat 
ccaccatcat 
cagctcgcct 
acgatgtggt 
ctcctggaca 
gcttcgcata 
aggagtcaat 
tcacagagcg 
atcgccggtg 
atttgaaggc 
gcggagacga 
tacgggcctt 
cagaatacga 
catctggcaa 
cgtgggagac 
cgcccacctt 
aggaacaact 
cacttgacct 
gttactctcc 
ccttgcgagt 
ggagggctgc 
tcactccaat 



acaccccata 
cacctgggtg 
cagcgtggtc 
ggaagtcctt 
cgaacaggga 
agccaccaag 
agccttctgg 
gtccactctg 
cagcccgctc 
ccaacttgct 
tgttggcagc 
.ggtggcaggc 
cctggttaac 
cgcagcgata 
gctgatagcg 
cgacgctgca 
gaggcttcac 
agatgtttgg 
gctcctgccg 
ctggcggggc 
tgtgaaaaac 
aacattcccc 
ttctagggcg 
tttccactac 
ccccgaattc 
acccctccta 
acagctccca 
ctcccacatt 
ggccagctca 
tcatgactcc 
cgggaacatc 
gctccaagcg 
caggaaattc 
agagtcctgg 
tgccaaggcc 
taccgtgtct 
ggccgtcgac 
gggatccgac 
cgatctcagc 
ctgctcgatg 
caagctgccc 
tacaacatct 
cctggacgac 
ggctaaactt 
taaatttggc 
ccgctccgtg 
ggcaaaaaat 
tatcgtattc 
ctccaccctc 
gcgggtcgag 
tgacacccgc 
ctaccaatgt 
gctttacatc 
ccgcgcgagc 
cgctgcggcc 
ccttgtcgtt 
cacggaggct 
cttggagttg 
aagggtgtac 
agctagacac 
gtgggcaagg 
tgaaaaagcc 
acctcagatc 
aggtgagatc 
ctggagacat 
cacttgtggc 
cccggctgcg 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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tcccagttgg 
agcctgtctc 
gtaggcatct 
ccatcctgtt 
ttttctcctt 
cctagtcacg 
tggcctctct 



atttatccag 
gtgcccgacc 
atctactccc 
tttttccctt 
tttttttcct 
gctagctgtg 
gcagatcaag 



ctggttcgtt 
ccgctggttc 
caaccgatga 
tttttttttc 
ctttttttcc 
aaaggtccgt 
t 



gctggttaca 
atgtggtgcc 
acggggagct 
tttttttttt 
ttttctttcc 
gagccgcttg 



gcgggggaga 
tactcctact 
aaacactcca 
tttttttttt 
tttggtggct 
actgcagaga 



catatatcac 
ttctgtaggg 
ggccaatagg 
tttttttttt 
ccatcttagc 
gtgctgatac 



7680 
7740 
7800 
7860 
7920 
7980 
8001 
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<210> 7 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 



<400> 7 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcttgcga 

ctcaaagaaa 

acgcaggttc 

caatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaactgtt 

atggcgatgc 

actgtggccg 

t tgctgaaga 

ctcccgattc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcattc 

gtgaaggaag 

tgcaggcagc 

taagatacac 

gaaagagtca 

gtaccccatt 

tcgaggttaa 

cacgataata 

ccccgcccac 

ttgccgcgca 

caacctcgtg 

gctcagcccg 

ctcctgtcac 

cgcaatttgg 

attccgctcg 

gttctggagg 

ttccttttgg 

gtatccggag 

gcggacatga 

cgctgctggg 

acgatacgac 

gtgggagatc 

cggcacgaga 

cgtatggctt 

ttactccgga 

gcgggccttg 

ctctttgccg 

ctcgggatta 

aacggcagct 

ttccttgctg 

gccagctgca 

tcacacagct 

gtacccgcgg 

gggacgaccg 

ctgcttctta 

agcactgggt 

aaaaccttga 



gattgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtaccc 
cccattcgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggttrccct 
gccgaagccg 
tgccg-cttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggatc 
aaaacgtcta 
ccatgggcac 
aggacgtcaa 
ggggccccag 
gaaggcgaca 
ggtacccctg 
cccgtggctc 
gtaaggtcat 
tcggcgcccc 
acggcgtgaa 
ctttgctgtc 
tgtaccatgt 
tcatgcatac 
tagcgctcac 
gccatgtcga 
tctgcggatc 
cagtacagga 
gggatatgat 
tcccacaagc 
cctactattc 
gcgttgacgg 
cgtccctctt 
ggcacatcaa 
cgctgttcta 
gccccatcga 
cggaccagag 
cgcaggtgtg 
accggttcgg 
acaacacgcg 
tcaccaagac 
cctgccccac 



gacactccac 
agccatggcg 
tagtggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgtgttcc 
ggtgccctga 
gttccttgcg 
ggcgaagtgc 
atcatggctg 
caccaagcga 
caggatgatc 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaatgggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
ggccccccga 
gaatcctaaa 
gttcccgggc 
gttgggtgtg 
acctatcccc 
gcccctctat 
tcggcctagt 
cgataccctc 
cctagggggc 
ctatgcaaca 
ctgtttgacc 
cacgaacgac 
ccccgggtgc 
tcccacgctc 
tttgctcgtt 
tgttttcctc 
ctgcaattgc 
gatgaactgg 
tgtcgtggat 
catggtgggg 
gggaacctat 
ttcacccggg 
caggactgcc 
cgtgcacaag 
cgcgttcgct 
gccttattgt 
tggtccagtg 
cgtccctacg 
gccgccgcaa 
gtgcgggggc 
ggactgcttc 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
cctcaaagaa 
ggtggtcaga 
cgcgcgacta 
aaggctcgcc 
ggcaatgagg 
tggggcccca 
acgtgcggct 
gctgccaggg 
gggaatctgc 
"atcccagctt 
tgctccaacg 
gtgccctgcg 
gcggccagga 
ggggcggctg 
gtcgcccagc 
tcaatatatc 
tcacctacag 
atggtggcgg 
aactgggcta 
gtgacagggg 
tcatcccaga 
ctgaactgca 
ttcaactcat 
caggggtggg 
tggcactacg 
tactgcttca 
tacagttggg 
ggcaactggt 
cccccgtgta 
cggaagcacc 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
cargctttac 
cgt ggttttc 
aaaccaaacg 
tcgtcggtgg 
ggaagacttc 
agcccgaggg 
gcttggggtg 
cggacccccg 
tcgccgatct 
ccctggcgca 
ccggttgctc 
ccgcttatga 
caagcattgt 
ttcgggagaa 
acgctagcgt 
ctctctgctc 
tgttcacctt 
ccggccacgt 
cagccctagt 
gggcccattg 
aggttctgat 
ggacgatggc 
aaatccagct 
atgactccct 
ctggatgccc 
ggcccatcac 
caccccggcc 
ccccaagccc 
gggagaatga 
ttggctgtac 
acatcggggg 
ccgaggccac 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gatggattgc 
gcacaacaga 
ccggttcttt 
gcgcggctat 
actgaagcgg 
tctcaccttg 
acgcttgatc 
cgtactcgga 
ctcgcgccag 
gtcgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctr 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
ctttgaaaaa 
taacaccaac 
agtttacctg 
cgagcggtcg 
tagggcctgg 
ggcaggatgg 
gcgtaggtcg 
catggggtac 
tggcgtccgg 
cttttctatc 
agtgcgcaac 
gtatgaggca 
caactcctcc 
ccccactacg 
cgctatgtac 
ctcgcctcgc 
gacaggtcac 
ggtatcgcag 
gggagtccta 
tgtgatgcta 
caaaaacacc 
tgtaaacacc 
caacactggg 
agagcgcatg 
ttacaatgag 
gtgcggtatc 
tgtcgtggtg 
gacggacgtg 
atggatgaat 
gatcggcaat 
ttacaccaag 



60 

120 

180 

240 

300 

360 

420- 

4S0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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tgtggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 

cactacccct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 

gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttg-aa cctggaggac 3780 

agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 

tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 

gtggacgtac aatacctgta cggtaraggg tcggcggttg tctcctttgc aatcaaatgg 3960 

gagtatgtcc tgttgctctt ccttcrtctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 

atgatgctgc tgatagctca agctgaggcc cccctagaga acctggtggt cctcaacgcg 4080 

gcatccgtgg ccggggcgca tggcartctc tccttcctcg tgttcttctg tgctgcctgg 4140 

tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcgt atggccgcta 4200 

ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 

tcgtgcggag gcgcggttcr cgtaggtctg aractcttga ccttgtcacc gcactataag 4320 

ctgttcctcg ctaggctcat atggtggtta caatatttta tcaccagggc cgaggcacac 4380 

ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt catcctcctc 4440 

acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 

ggtccactca tggtgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 

gggctcattc gtgcatgca- gctgg-gcgg aaggttgctg ggggtcatta tgtccaaatg 4620 

gctctcatga agttggccgc actgacaggt acgtacgttt' atgaccatct caccccactg 4680 

cgggactggg cccacgcggg cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 

tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 

atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4860 

agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggcctactc ccaacagacg 4 920 

cgaggcctac ttggctgcat: catcactagc cicacaggcc gggacaggaa ccaggtcgag 4980 

ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 5040 

gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc aaagggccca 5100 

atcacccaaa rgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 

gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 

gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 

cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcatgct 5340 

gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 

cccgtcgagt ctatggaaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 54 60 

ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 

agcactaagg tgccggctgc gtatgcagcc caagggtata aggtgcttgt cctgaacccg 5580 

tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 

aacatcagaa ccggggtagg gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 

ggcaagtttc ttgccgacgg tggttgctct gggggcgccr atgacatcat aatatgtgat 5760 

gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 

gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 

gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 5940 

ggcaaagcca tccccatcgc gaccatcaag ggggggaggc acctcatttt ctgccattcc 6000 

aagaagaaat gtgatgagct cgccgcgaag ctatccggcc tcggactcaa tgctgtagca 6060 

tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 

acggacgctc taatgacggg ctttaccggc gatttcgact cagtgatcga ctgcaataca 6180 

tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 

gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 

ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 

ctgtgcgagt gctatgacgc gggctgtgctf'tggtacgagc tcacgcccgc cgagacctca 6420 

gttaggttgc gggcttacct aaacacacca gggctgcccg tctgccagga ccatctggag 6480 

ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 

aggcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 

gctcaggctc cacctccatc gtgggaccaa atgtgggagt gtctcatacg gctaaagcct 6660 

acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 

accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 

acgagcacct gggtgctggt aggcggagtc ctagcagctc tggccgcgta ttgcctgaca 6840 

acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccggc catcattccc 6900 

gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 

tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 

caaacagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 

ctcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 7140 

ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 

atcaccagcc cgctcaccac ccaacatacc ctcctgttta acatcctggg gggatgggtg 7260 

gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 

gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga 7380 

gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagat gccctccacc 7440 

gaggacctgg ttaacctact ccctgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 

gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7 5 60 

aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgtct 7620 
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gagagcgacg 
ctgaagaggc 
ctaagacatg 
tccaagctcc 
ggagtctggc 
ggacatgtga 
catggaacat 
aattattcta 
ggggatttcc 
ccggcccccq 
tgcaaacccc 
gggtcacagc 
gacccctccc 
tccttggcca 
acccgtcatg 
atgggcggga 
gagccgctcc 
aggtccagga 
ctgttagagt 
ccgcctgcca 
gaatctaccg 
tcgtcggccg 
gacgcgggat 
gatcccgatc 
gtctgctgct 
gaaaccaagc 
tatgctacaa 
caggtcctgg 
gttaaggcta 
agatctaaat 
cacatccgct 
atcatggcaa 
cgccttatcg 
gtggtctcca 
ggacagcggg 
gcatatgaca 
tcaatctacc 
gagcggcttt 
cggtgccgcg 
aaggccgctg 
gacgaccttg 
gccttcacgg 
tacgacttgg 
ggcaaaaggg 
gagacagcta 
accttgtggg 
caacttgaaa 
gacctacctc 
tctccaggtg 
cgagtctgga 
gctgccactt 
ccaatcccgg 
ggagacatat 
ctactttctg 
ctccaggcca 
tttttttttt 
tggctccatc 
agagagtgct 



ctgcagcacg 
ttcaccagtg 
tttgggattg 
tgccgcgatt 
ggggcgacgg 
aaaacggttc 
tccccattaa 
gggcgctgtg 
actacgtgac 
aattcttcac 
tcctacggga 
tcccatgcgg 
acattacggc 
gctcatcagc 
actccccgga 
acatcacccg 
aagcggagga 
aattccctcg 
cctggaagga 
aggcccct cc 
tgtcttctgc 
tcgacagcgg 
ccgacgttga 
tcagcgacgg 
cgatgtccta 
tgcccgtcaa 
catctcgcag 
acgaccacta 
aacttctatc 
ttggctargg 
ccgtgtggaa 
aaaatgaggt 
tattcccaga 
ccctccczca 
tcgagttcct 
cccgctgttt 
aatgttgrga 
acatcggggg 
cgagcggtgt 
cggcctgrcg 
tcgttatctg 
aggctatgac 
agttgataac 
tgtactatct 
gacacactcc 
caaggatgat 
aagccctaga 
agatcattca 
agatcaatag 
gacatcgggc 
gtggcaagta 
ctgcgtccca 
atcacagcct 
taggggtagg 
ataggccatc 
tttttttttc 
ttagccctag 
gatactggcc 



tgtcactcag 
gatcaacgag 
gatatgcacg 
gccgggagtc 
catcatgcaa 
catgaggatc 
cgcgtacacc 
gcgggtggct 
gggcatgacc 
agaagtggat 
ggaggtcaca 
gcccgaaccg 
ggagacggct 
tagccagctg 
cgctgacctc 
cgtggagtca 
ggatgagagg 
agcgatgccc 
cccggactac 
gataccacct 
cttggcggag 
cacggcaacg 
gtcgtactcc 
gtcttggtct 
cacat ggaca 
tgcactgagc 
cgcaagcctg 
ccgggacgtg 
cgtggaggaa 
ggcaaaggac 
ggacttgctg 
tttctgcgtc 
tttgggggtt 
ggccgtgatg 
ggtgaatgcc 
tgactcaacg 
cttggccccc 
ccccctgact 
actgacgacc 
agctgcgaag 
tgaaagcgcg 
tagatactct 
atcatgctcc 
cacccgtgac 
agtcaattcc 
cctgatgact 
ttgtcagatc 
acgactccat 
ggtggcttca 
cagaagtgtc 
cctcttcaac 
gttggattta 
gtctcgtgcc 
catctatcta 
ctgttttttt 
tccttttttt 
tcacggctag 
tctctgcaga 



atcctctcta 

gactgctcca 

gtgttgactg 

cccttcttct 

accacctgcc 

gtggggccta 

acgggcccct 

gctgaggagt 

actgacaacg 

ggggtgcggt 

ttcctggtcg 

gacgtagcag 

aagcgtaggc 

tctgcgcctt 

atcgaggcca 

gaaaataagg 

gaagtatccg 

atatgggcac 

gtccctccag 

ccacggaggg 

ctcgccacaa 

gcctctcctg 

tccatgcccc 

accgtaagcg 

ggcgccctga 

aactctttgc 

cggcagaaga 

ctcaaggaga 

gcctgtaagc 

gtccggaacc 

gaagacactg 

caaccagaga 

cgtgtgtgcg 

ggctcttcat 

tggaaagcga 

gtcactgaga 

gaagccagac 

aattctaaag 

agctgcggta 

ctccaggact 

gggacccaag 

gccccccctg 

tccaatgtgt 

cccaccaccc 

tggctaggca 

catttcttct 

tacggggcct 

ggccttagcg 

tgcctcagga 

cgcgctaggc 

tgggcagtaa 

tccagctggt 

cgaccccgct . 

ctccccaacc 

cccttttttt 

ttcctctttt 

ctgtgaaagg 

tcaagt 



gtcttaccat 
cgccatgctc 
atttcaagac 
catgtcaacg 
catgtggagc 
ggacctgtag 
gcacgccctc 
acgtggaggt 
taaagtgccc 
tgcacaggta 
ggctcaatca 
tgctcacttc 
tggccagggg 
ccttgaaggc 
acctcctgtg 
tagtaatttt 
ttccggcgga 
gcccggatta 
tggtacacgg 
agaggacggt 
agaccttcgg 
accagccctc 
cccttgaggg 
aggaggctag 
tcacgccatg 
tccgtcacca 
aggtcacctt 
tgaaggcgaa 
tgacgccccc 
tatccagcaa 
agacaccaat 
aggggggccg 
agaaaatggc 
acggattcca 
agaaatgccc 
atgacatccg 
aggccataag 
ggcagaactg 
ataccctcac 
gcacgatgct 
aggacgaggc 
gggacccgcc 
cagtcgcgca 
cccttgcgcg 
acatcatcat 
ccatccttct 
gttactccat 
cattttcact 
aacttggggt 
tactgtccca 
ggaccaagct 
tcgttgctgg 
ggttcatgtg 
gatgaacggg 
ttttcttttt 
tttccttttc 
tccgtgagcc 



cactcagctg 
cggctcgtgg 
ctggctccag 
tgggtacaag 
acagatcacc 
taacacgtgg 
cccggcgcca 
tacgcgggtg 
gtgtcaggtt 
cgctccagcg 
atacctggtt 
catgctcacc 
atctcccccc 
aacatgcact 
gcggcaggag 
ggact ctttc 
gatcctgcgg 
caaccctcca 
gtgtccattg 
tgtcctgtca 
cagctccgaa 
cgacgacggc 
ggagccgggg 
tgaggacgtc 
cgctgcggag 
caacttggtc 
tgacagactg 
ggcgtccaca 
acattcggcc 
ggccgttaac 
tgacaccacc 
caagccagct 
cctttacgat 
atactctcct 
tatgggcttc 
tgttgaggag 
gtcgctcaca 
cggctatcgc 
atgttatttg 
cgtatgcgga 
gagcctacgg 
caaaccagaa 
cgatgcatct 
ggctgcgtgg 
gtatgcgccc 
agctcaggaa 
tgagccactt 
ccatagttac 
accgcccttg 
gggggggagg 
caaactcact 
ttacagcggg 
gtgcctactc 
gagctaaaca 
tttttttttt 
tttcctttgg 
gcttgactgc 



7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

109B0 

11040 

11076 
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<210> 8 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 



<400> 8 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcttgcga 

ctcaaagaaa 

acgcaggttc 

caatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaactgtt 

atggcgatgc 

actgtggccg 

ttgctgaaga 

ctcccgat tc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcat tc 

gtgaaggaag 

tgcaggcagc 

taagatacac 

gaaagagtca 

gtaccccatt 

tcgaggttaa 

cacgataata 

tgcatcatca 

gtctccaccg 

tatcatggtg 

accaatgtgg 

ccatgcacct 

gtgcgccggc 

aagggctctt 

gctgccgtgt 

gaaaccacta 

acattccagg 

gctgcgtatg 

ctaggtttcg 

gtaaggacca 

gacggtggtt 

gactcgacca 

cgactcgtcg 

atcgaggagg 

atcgagacca 

gagctcgccg 

gatgtatccg 

acgggcttta 

gtcgacttca 

gtgtcacgct 

gtgactccag 

gacgcgggct 

tacctaaaca 

tttacaggcc 

aacttcccct 

ccatcgtggg 



gattgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagt a 
gtgccccggg 
aaccaaacgt 
tccggecgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggatc 
aaaacgtcta 
ccatggcgcc 
ctagcctcac 
caacacaatc 
ccggctcaaa 
accaggacct 
gcggcagctc 
ggggcgacag 
cgggcggtcc 
gcacccgagg 
tgcggtcccc 
tggcccarct 
cagcccaagg 
gggcgtatat 
tcaccacggg 
gctctggggg 
ctatcctggg 
tgctcgccac 
tggctctgtc 
tcaagggggg 
cgaagctgtc 
tcataccaac 
ccggtgactt 
gcctggaccc 
cgcagcggcg 
gagaacggcc 
gtgcttggta 
caccagggtt 
tcacccacat 
acctggtagc 
accaaatgtg 



gacactccac 
agccatggcg 
tagtggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgtgttcc 
ggtgccctga 
gttccttgcg 
ggcgaagtgc 
atcatggcrg 
caccaagcga 
caggatgatc 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaatgggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
ggccccccga 
tattacggcc 
aggccgggac 
tttcctggcg 
gacccttgcc 
cgtcggctgg 
ggacctttac 
cagggggagc 
actgctctgc 
ggttgcgaag 
ggtcttcacg 
acacgcccct 
gtataaggtg 
gtctaaggca 
tgcccccatc 
cgcctatgac 
catcggcaca 
cgctacgcct 
cagcactgga 
gaggcacctc 
cggcctcgga 
tagcggagac 
cgactcagtg 
gaccttcacc 
aggcaggact 
ctcgggcatg 
cgagctcacg 
gcccgtctgc 
agacgcccat 
ataccaggct 
gaagtgtctc 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
t tcaacaagg 
cctcggtgca 
accacgggga 
tactcccaac 
aggaaccagg 
acctgcgtca 
ggcccaaagg 
caagcgcccc 
ttggtcacga 
ctactctccc 
ccctcggggc 
gcggtggact 
ga.caactcgt 
actggtagcg 
cttgtcctga 
catggtatcg 
acgtactcca 
atcataatat 
gtcctggacc 
ccgggatcgg 
gaaatcccct 
attttctgcc 
ctcaatgctg 
gtcattgtcg 
atcgactgca 
attgagacga 
ggtaggggca 
ttcgattcct 
cccgccgaga 
caggaccatc 
ttcttgtccc 
acggtgtgcg 
atacggctaa 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
agacgcgagg 
tcgaggggga 
atggcgtgtg 
gcccaatcac 
ccggggcgcg 
ggcatgccga 
ccaggcccgt 
acgctgtggg 
ttgtacccgt 
cccctccggc 
gcaagagcac 
acccgtccgt 
accctaacat 
cctatggcaa 
gtgatgagtg 
aagcggagac 
tcaccgtgcc 
tttatggcaa 
attccaagaa 
tagcatatta 
tagcaacgga 
atacatgtgt 
cgaccgtgcc 
ggatgggcat 
cggttctgtg 
cctcagttag 
tggagttctg 
agactaagca 
ccagggctca 
agcctacgct 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gatggattgc 
gcacaacaga 
ccggttcttt 
gcgcggctat 
actgaagcgg 
tctcaccttg 
acgcttgatc 
cgtactcgga 
ctcgcgccag 
gtcgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
ctttgaaaaa 
cctacttggc 
ggtccaagtg 
ttggactgtc 
ccaaatgtac 
ttccttgaca 
tgtcattccg 
ctcctacttg 
catctttcgg 
cgagtctatg 
cgtaccgcag 
taaggtgccg 
cgccgccacc 
cagaatcggg 
gtttcttgcc 
ccactcaact 
ggctggagcg 
acatccaaac 
agccatcccc 
gaaatgtgat 
ccggggcctt 
cgctctaatg 
cacccagaca 
acaagacgcg 
ttacaggttt 
cgagtgctat 
gttgcgggct 
ggagagcgtc 
ggcaggagac 
ggctccacct 
gcacgggcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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acgcccctgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 
accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 
ctggtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgtggtc 3780 
attgtgggca ggatcatctt gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 
taccgggagt tcgatgagat ggaagagtgc gcctcacacc tcccttacar cgaacaggga 3900 
atgcagctcg ccgaacaatt caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 
caagcggagg ctgctgctcc cgrggtggaa tccaagtggc ggaccatcga agccttctgg 4020 
gcgaagcata tgtggaattt cai.cagcggg atacaatatt tagcaggctt gtccactctg 4080 
cctggcaacc ccgcgatagc atcactgatg gcattcacag cctctatcac cagcccgctc 4140 
accacccaac ataccctcct gtttaacatc ctggggggat gggtggccgc ccaacttgct 4200 
cctcccagcg ctgcttctgc tttcgtaggc gccggcatcg ctggagcggc tgttggcagc 4260 
ataggccttg ggaaggtgct tgtggatatt ttggcaggtt atggagcagg ggtggcaggc 4320 
gcgctcgtgg cctttaaggt catgagcggc gagatgccct ccaccgagga cctggttaac 4380 
ctactccctg ctatcctctc ccctggcgcc ctagtcgtcg gggtcgtgtg cgcagcgata 4440 
ctgcgtcggc acgtgggccc aggggagggg gctgtgcagt ggatgaaccg gctgatagcg 4500 
ttcgcttcgc ggggtaacca cgtctccccc acgcactatg tgcctgagag cgacgctgca 4560 
gcacgtgtca ctcagatcct ctctagtctt accatcactc agctgctgaa gaggcttcac 4620 
cagtggatca acgaggactg ctccacgcca tgctccggct cgtggctaag agatgtttgg 4680 
gattggatat gcacggtgtt gactgatttc aagacctggc tccagtccaa gctcctgccg 4740 
cgattgccgg gagtcccctt cttctcatgt caacgtgggt acaagggagt ctggcggggc 4800 
gacggcatca tgcaaaccac ctgcccatgt ggggcacaga tcaccggaca tgtgaaaaac 4860 
ggttccatga ggatcgtggg gcctaggacc tgtagtaaca cgtggcatgg aacattcccc 4920 
attaacgcgt acaccacggg cccctgcacg ccctccccgg cgccaaatta ttctagggcg 4 980 
ctgtggcggg tggctgctga ggagtacgtg gaggttacgc gggtggggga tttccactac 5040 
gtgacgggca tgaccactga cgacgtaaag tgcccgtgtc aggttccggc ccccgaattc 5100 
ttcacagaag tggatggggt gcggttgcac aggtacgctc cagcgtgcaa acccctccta 5160 
cgggaggagg tcacattcct ggtcgggctc aatcaatacc tggttgggtc acagctccca 5220 
tgcgagcccg aaccggatgt agcagtgctc acttccatgc tcaccgaccc ctcccacatt 5280 
acggcggaga cggctaagcg taggctggcc aggggatctc ctcccccctt ggccagctca 534 0 
tcagctagcc agctgtctgc gccttccttg aaggcaacat gcactacccg tcatgactcc 5400 
ccggacgctg acctcatcga ggccaacctc ctgtggcggc aggagatggg cgggaacatc 5460 
acccgcgtgg agtcagaaaa taaggtagta attttggact ctttcgagcc gctccaagcg 5520 
gaggaggatg agagggaagt atccgttccg gcggagatcc tgcggaggtc caggaaattc 5580 
cctcgagcga tgcccatatg ggcacgcccg gattacaacc ctccactgtt agagtcctgg 5640 
aaggacccgg actacgtccc tcca'gtggta cacgggtgtc cattgccgcc tgccaaggcc 5700 
cctccgatac caccttcacg gaggaagagg acggttgtcc tgtcagaatc taccgtgtct 5760 
tctgccttgg cggagctcgc cacagagacc ttcggcagct ccgaatcgtc ggccgtcgac 5820 
agcggcacgg caacggcctc tcctgaccag ccctccgacg acggcgacgc gggatccgac 5880 
gttgagtcgt actcctccat gccccccctt gagggggagc cgggggatcc cgatctcagc 5940 
gacgggtctt ggtctaccgt aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 
tccracacat ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 6060 
atcaatgcac tgagcaactc tttgctccgt caccacaact tggtctatgc tacaacatct 6120 
cgcagcgcaa gcctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 6180 
cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaactt 624 0 
ctatccgtgg aggaagcctg taagctgacg cccccacatt cggccagatc taaatttggc 6300 
tatggggcaa aggacgtccg gaacetatcc agcaaggccg ttaaccacat ccgctccgtg 6360 
tggaaggact tgctggaaga cactgagaca'ccaattgaca ccaccatcat ggcaaaaaat 6420 
gaggttttct gcgtccaacc agagaagggg ggccgcaagc cagctcgcct tatcgtattc 64 8 0 
ccagatttgg gggttcgtgt gtgcgagaaa atggcccttt acgatgtggt ctccaccctc 654 0 
cctcaggccg tgatgggctc ttcatacgga ttccaatact ctcctggaca gcgggtcgag 6600 
ttcctggtga atgcctggaa agcgaagaaa tgccctatgg gcttcgcata tgacacccgc 6660 
tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 6720 
tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 6780 
gggggccccc tgactaattc taaagggcag aactgcggct atcgccggtg ccgcgcgagc 684 0 
ggtgtactga cgaccagctg cggtaatacc ctcacatgtt acttgaaggc cgctgcggcc 6900 
tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcggagacga ccttgtcgtt 6960 
atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 7020 
atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 7080 
ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 714 0 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 7200 
actccagtca attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 7260 
atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 7320 
ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 7380 
attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 7440 
aatagggtgg cttcat gcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 7500 
cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 7560 
aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 762 0 
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tcccagttgg atttatccag 
agcctgtctc gtgcccgacc 
gtaggcatct atctactccc 
ccatcctgtt tttttccctt 
ttttctcctt tttttttcct 
cctagtcacg gctagctgtg 
tggcctctct gcagatcaag 



ctggttcgtt gctggttaca 
ccgctggttc atgtggtgcc 
caaccgatga acggggagct 
tttttttttc tttttttttt 
ctttttttcc ttttctttcc 
aaaggtccgt gagccgcttg 
t 



gcgggggaga catatatcac 7680 
tactcctact ttctgtaggg 7740 
aaacactcca ggccaatagg 7800 
tttttttttt tttttttttt 7860 
tttggtggct ccatcttagc 7920 
actgcagaga gtgctgatac 7980 

8001 
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<210> 9 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 
<400> 9 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtgctctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccgcg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc ragccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg agctctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcgcctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaacrgca ggacgaggca gcgcggctat- 600 
cgtggctggc cacgacgggc gttccttgcg cagctgrgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagoga aacatcgcat cgagcgagca cgtactcgga 84 0 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt: cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcptct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgag.tg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta trcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg. acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc'atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 2460 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 2640 
gtgggagatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgcag 2820 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttccttgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 324 0 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
ctgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
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tgtggttcgg ggccttggtt gacacccaga 
cactacccct gcactgtcaa ctttaccatc 
gagcacaggc tcgaagccgc atgcaattgg 
agggacagat cagagcttag cccgctgctg 
tgttccttca ccaccctacc ggctcrgtcc 
gtggacgtac aatacctgta cggtaraggg 
gagtatgtcc tgttgctctt ccttcrtctg 
atgatgctgc tgatagctca agctgaggcc 
gcatccgtgg ccggggcgca tggcattctc 
tacatcaagg gcaggctggt ccctggggcg 
ctcctgcrcc tgctggcgtt accaccacga 
tcgtgcggag gcgcggtttt cgtaggtctg 
ctgttcctcg ctaggctcat atggtggtta 
ttgcaagtgt ggatcccccc cctcaacgtt 
acgtgcgcga- tccacccaga gctaatcttt 
ggtccactca tggtgctcca ggc^ggtata 
gggctcattc gtgcatgcat gctggtgcgg 
gctctcatga agttggccgc actgacaggt 
cgggactggg cccacgcggg cctacgagac 
tctgatatgg agaccaaggt tatcacctgg 
atcrtgggcc tgcccgtctc cgcccgcagg 
agccttgaag ggcaggggtg gcgactcctc 
cgaggcctac ttggctgcat catcactagc 
ggggaggtcc aagtggtctc caccgcaaca 
gtgtgttgga ctgtctatca tggtgccggc 
atcacccaaa tgtacaccaa tgtggaccag 
gcgcgttcct tgacaccatg cacctgcggc 
gccgatgtca ttccggtgcg ccggcggggc 
cccgtctcct acttgaaggg ctcttcgggc 
gtgggcatct ttcgggctgc cgtgtgcacc 
cccgtcgagt ctatggaaac cactatgcgg 
ccggccgtac cgcagacatt ccaggtggcc 
agcactaagg tgccggctgc gtatgcagcc 
tccgtcgccg ccaccctagg tttcggggcg 
aacatcagaa tcggggtaag gaccatcacc 
ggcaagtttc ttgccgacgg tggttgctct 
gagtgccact caactgactc gaccactatc 
gagacggctg gagcgcgact cgtcgtgctc 
gtgccacatc caaacatcga ggaggtggct 
ggcaaagcca tccccatcga gaccatcaag 
aagaagaaat gtgatgagct cgccgcgaag 
tattaccggg gccttgargt. atccgtcata 
acggacgctc taatgacggg ctttaccggt 
tgtgtcaccc agacagtcga cttcagcctg 
gtgccacaag acgcggtgtc acgctcgcag 
ggcatttaca ggtttgtgac tccaggagaa 
ctgtgcgagt gctatgacgc ggg'ctgtgct 
gttaggttgc gggcttacct aaacacacca 
ttctgggaga gcgtctttac aggcctcacc 
aagcaggcag gagacaactt cccctacctg 
gctcaggctc cacctccatc gtgggaccaa 
acgctgcacg ggccaacgcc cctgctgtat 
accacacacc ccataaccaa atacatcatg 
acgagcacct gggtgctggt aggcggagtc 
acaggcagcg tggtcattgt gggcaggatc 
gacagggaag tcctttaccg ggagttcgat 
tacatcgaac agggaatgca gctcgccgaa 
caaacagcca ccaagcaagc ggaggctgct 
atcgaagcct tctgggcgaa gcatatgtgg 
ggcttgtcca ctctgcctgg caaccccgcg 
atcaccagcc cgctcaccac ccaacatacc 
gccgcccaac ttgctcctcc cagcgctgct 
gcggctgttg gcagcatagg ccttgggaag 
gcaggggtgg caggcgcgct cgtggccttt 
gaggacctgg ttaacctact ccctgctatc 
gtgtgcgcag cgatactgcg tcggcacgtg 
aaccggctga tagcgttcgc ttcgcggggt 



tgcttggtcc actacccata caggctttgg 3660 
ttcaaggtta ggatgtacgt ggggggagtg 3720 
actcgaggag agcgttgtaa cctggaggac 3780 
ctgtctacaa cggagtcgca ggtattgccc 3840 
actggtttga tccatctcca tcagaacgtc 3900 
tcggcggttg tctcctttgc aatcaaatgg 3960 
gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
gccctagaga acctggtggr cctcaacgcg 4080 
tccttcctcg tgttcttctg rgctgcctgg 4140 
gcatatgccc tctacggcgt atggccgcta 4200 
gcatacgcca tggaccggga gatggcagca 4260 
atacicttga ccttgtcacc gcactataag 4320 
caatatttta tcaccagggc cgaggcacac 4380 
cgggggggcc gcgatgccgt catcctcctc 4440 
accatcacca aaatcttgct cgccatactc 4500 
accaaagtgc cgtacttcgt gcgcgcacac 4560 
aaggttgctg ggggtcatta tgtccaaatg 4620 
acgtacgttt atgaccatct caccccactg 4680 
cttgcggtgg cagttgagcc cgtcgtcttc 4740 
ggggcagaca ccgcggcgtg tggggacatc 4800 
gggagggaga tacatctggg accggcagac 4860 
gcgcctatta cggcctactc ccaacagacg 4920 
ctcacaggcc gggacaggaa ccaggtcgag 4980 
caatctttcc tggcgacctg cgtcaatggc 5040 
tcaaagaccc ttgccggccc aaagggccca 5100 
gacctcgtcg gctggcaagc gccccccggg 5160 
agctcggacc tttacttggt cacgaggcat 5220 
gacagcaggg ggagcctact ctcccccagg 5280 
ggtccactgc tctgcccctc ggggcacgct 5340 
cgaggggttg cgaaggcggt ggactttgta 5400 
tccccggtct tcacggacaa ctcgtcccct 54 60 
catctacacg cccctactgg tagcggcaag 5520 
caagggtata aggtgcttgt cctgaacccg 5580 
tatatgtcta aggcacatgg tatrcgaccct 5640 
acgggtgccc ccatcacgta ctccacctat 5700 
gggggcgcct atgacatcat aatatgtgat 5760 
ctgggcatcg gcacagtcct ggaccaagcg 5820 
gccaccgcta cgcctccggg atcggrcacc 5880 
ctgtccagca ctggagaaat ccccttttat 5940 
ggggggaggc acctcatttt ctgccattcc 6000 
ctgtccggcc tcggactcaa tgctgtagca 6060 
ccaactagcg gagacgtcat tgtcgtagca 6120 
gacttcgact cagtgatcga ctgcaataca 6180 
gacccgacct tcaccattga gacgacgacc 6240 
cggcgaggca ggactggtag gggcaggatg 6300 
cggccctcgg gcatgttcga ttcctcggtt 6360 
"tggtacgagc tcacgcccgc cgagacctca 6420 
gggttgcccg tctgccagga ccatctggag 6480 
cacatagacg cccatttctt gtcccagact 6540 
gtagcatacc aggctacggt gtgcgccagg 6600 
atgtggaagt gtctcatacg gctaaagcct 6660 
aggctgggag ccgttcaaaa cgaggttact 6720 
gcatgcatgt cggctgacct ggaggtcgtc 6780 
ctagcagctc tggccgcgta ttgcctgaca 68 4 0 
atcttgtccg gaaagccggc catcattccc 6900 
gagatggaag agtgcgcctc acacctccct 6960 
caattcaaac agaaggcaat cgggttgctg 7020 
gctcccgtgg tggaatccaa gtggcggacc 7080 
aatttcatca gcgggataca atatttagca 7140 
atagcatcac tgatggcatt cacagcctct 7200 
ctcctgttta acatcctggg gggatgggtg 7260 
tctgctttcg taggcgccgg catcgctgga 7320 
gtgcttgtgg atattttggc aggttatgga 7380 
aaggtcatga gcggcgagat gccctccacc 7 4 40 
ctctcccctg gcgccctagt cgtcggggtc 7500 
ggcccagggg agggggctgt gcagtggatg 7560 
aaccacgtct cccccacgca ctatgtgcct 7620 
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gagagcgacg ctgcagcacg tgtcactcag atcctctcta gtcttaccat cactcagctg 7680 
ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggggc acagatcacc 7920 
ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca 8040 
aattattcta gggcgctgtg gcgggtggct gctgaggagt acgtggaggt tacgcgggtg 8100 
ggggatttcc actacgtgac gggcatgacc actgacgacg taaagtgccc gtgtcaggtt 6160 
ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 
gggtcacagc tcccatgcga gcccgaaccg gatgtagcag tgctcacttc catgcrcacc 8340 t 
gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcctccc 8400 
cccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 
acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggcaggag 8520 
atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 
gagccgctcc aagcggagga ggatgagagg gaagtatccg ttccggcgga gatcctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct tcacggagga agaggacggt tgtcctgtca 8820 
gaatctaccg tgtcttctgc cttggcggag ctcgccacag agaccttcgg cagctccgaa 8880 
tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 
gacgcgggat ccgacgtcga gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggaggctag tgaggacgtc 9060 
gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 
gaaaccaagc tgcccatcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaagcctg cggcagaaga aggtcacctt tgacagactg 924 0 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc" tgacgccccc acattcggcc 9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 94 80 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgrgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9340 
gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttacttg 9960 
aaggccgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 102 60 
gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatc w tacggggcct gttactccat tgagccactt 10440 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctccaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt accgcccttg 10560 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 
gctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 
ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 
tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 
tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 
agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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<210> 10 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 
<400> 10 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 4 80 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga azgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgctgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagega aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc . tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc cgcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcccaac agacgcgagg cctacttggc I860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag gcggtggact ttgtacccgt cgagtctatg 2340 
ggaaccacta tgcggtcccc ggtcttcacg"gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 2460 
gctgcgtatg caggccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaatcggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaagggggg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgactt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tggagttctg ggagagcgtc 34 20 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
ccatcgtggg accaaatgtg gaagtgtctc atacggctaa agcctacgct gcacgggcca 3600 
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acgcccctgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 
accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 
ctggtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgtggtc 3780 
attgtgggca ggatcatctt gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 
taccgggagt tcgatgagat ggaagagtgc gcctcacacc tcccttacat cgaacaggga 3900 
atgcagctcg ccgaacaatt caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 
caagcggagg ctgctgctcc cgtggtggaa tccaagtggc ggaccatcga agccttctgg 4020 
gcgaagcata tgtggaattt catcagcggg atacaatatt tagcaggctr gtccactctg 4080 
cctggcaacc ccgcgatagc atcactgatg gcattcacag cctctatcac cagcccgctc 4140 
accacccaac ataccctcct gt-taacatc ctggggggat gggtggccgc ccaacttgct 4200 
cctcccagcg ctgcttctgo tttcgtaggc gccggcatcg ctggagcggc tgttggcagc 4260 
ataggccttg ggaaggtgct tgtggatatt ttggcaggtt atggagcagg ggtggcaggc 4320 
gcgctcgtgg cctt-caaggt catgagcggc gagttgccct ccaccgagga cctgg;ttaac 4380 
ctactccctg ctatcctctc ccctggcgcc ctagtcgtcg gggtcgtgtg cgcagcgata 4440 
ctgcgtcggc acgtgggccc aggggagggg gctgtgcagt ggatgaaccg gctgatagcg 4500 
ttcgcttcgc ggggtaacca cgtctccccc acgcacratg tgcctgagag cgacgctgca 4560 
gcacgagtca ctcagatcct c~ctagtctt accatcactc agctgctgaa caggcttcac 4620 
cagtggatca acgaggactg ctccacgcca tgctccggct cgtggctaag agatgtttgg 4680 
gattggatat gcacggtgtt gactgatttc aagacctggc tccagtccaa gctcctgccg 4740 
cgattgccgg " gagtcccctt cttctcatgt caacgtgggt acaagggagt ctggcggggc 4 800 
gacggcatca tgcaaaccac ctgcccatgt ggagcacaga tcaccggaca tgtgaaaaac 4860 
ggttccatga ggatcgtggg gcctaggacc tgtagtaaca cgtggcatgg aacattcccc 4920 
attaacgcgt acaccacggg cccctgcacg ccctccccgg cgccaaatta ttctagggcg 4980 
ctgtggcggg taggtgctga ggagtacgtg gaggttacgc gggtggggga tttccactac 5040 
gtgacgggca tgaccactga caacgtaaag tgcccgtgtc aggttccggc ccccgaattc 5100 
ttcacagaag rggatggggt gcggttgcac aggtacgctc cagcgtgcaa acccctccta 5160 
cgggaggagg tcacattcct ggtcgggctc aatcaatacc tggttgggtc acagctccca 5220 
tgcgagcctg aaccggatgt agcagtgctc acttccatgc tcaccgaccc ctcccacatt 5280 
acggcggaga cggctaagcg taggctggcc aggggatctc cccccccctt ggccagctca 5340 
tcagctagcc agctgtctgc gccttccttg aaggcaacat gcactacccg tcatgactcc 5400 
ccggacgctg acctcatcga ggccaacctc ctgtggcggc aggagatggg cgggaacatc 54 60 
acccgcgtgg agtcagaaaa taaggtagta attttggact ctttcgagcc gctccaagcg 5520 
gaggaggatg agaggggagt atccgttccg gcggagatcc tgcggaggtc caggaaattc 5580 
cctcgagcga tgcccatatg ggcacgcccg gattacaacc ctccactgtt agagtcctgg 5640 
aaggacccgg actacgtccc tccagtggta cacgggtgtc cattgccgcc tgccaaggcc 5700 
cctccgatac caccttcacg gaggaagagg acggttgtcc tgtcagaatc taccgtgtct 5760 
tctgccttgg cggagctcgc cacagagacc ttcggcagct ccgaatcgtc ggccgtcgac 5820 
agcggcacgg caacggcctc tcctgaccag ccctccgacg acggcgacgc gggatccgac 5880 
gttgagtcgt actcctccat gccccccctt gagggggagc cgggggatcc cgatctcagc 594 0 
gacgggtctt ggtctaccgt aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 
tcctacacat ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 6060 
atcaatgcac tgagcaactc tttgctccgt caccacaact tggtctatgc tacaacatct 6120 
cgcagcgcaa acctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 6180 
cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaactt 6240 
ctatccgtgg aggaagcctg taagctgacg cccccacatt cggccagatc taaatttggc 6300 
targgggcaa aggacgtccg gaacctatcc agcaaggccg ttaaccacat ccgctccgtg 6360 
tggaaggact tgctggaaga cactgagaca " ccaattgaca ccaccatcat ggcaaaaaat 6420 
gaggttttct gcgtccaacc agagaagggg ggccgcaagc cagctcgcct tatcgtattc 6480 
ccagatttgg gggttcgtgt gtgcgagaaa atggcccttt acgatgtggt ctccaccctc 6540 
cctcaggccg tgatgggctc ttcatacgga ttccaatact ctcctggaca gcgggtcgag 6600 
ttcctggtga atgcctggaa agcgaagaaa tgccctatgg gcttcgcata tgacacccgc 6660 
tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 6720 
tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 6780 
gggggccccc tgactaattc taaagggcag aactgcggct atcgccggtg ccgcgcgagc 6840 
ggtgtactga cgaccagctg cggtaatacc ctcacatgtt acttgaaggc cgctgcggcc 6900 
tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcggagacga ccttgtcgtt 6960 
atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 7020 
atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 7080 
ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 714 0 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 7200 
actccagtca attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 72 60 
atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 7320 
ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 7380 
attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 7440 
aatagggtgg cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 7500 
cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 75 60 
aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 7620 
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tcccagttgg atttatccag ctggttcgtt 
agcctgtctc gtgcccgacc ccgctggttc 
gtaggcatct atctactccc caaccgatga 
ccatcctgtt tttttccctt tttttttttc 
ttttctcctt tttttttcct ctttttttcc 
cctagtcacg gcragctgtg aaaggtccgt 
tggcctctct gcagatcaag t 



gctggttaca gcgggggaga catatatcac 7680 
atgtggtgcc tactcctact ttctgtaggg 7740 
acggggagct aaacactcca ggccaatagg 7800 
tttttttttt tttttttttt tttttttttt 7860 
ttttctttcc tttggtggct ccatcttagc 7920 
gagccgcttg actgcagaga gtgctgatac 7980 

eooi 
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<210> 11 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 
<400> 11 

gccagccccc gattgggggs gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtcccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
cccaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgcr tgggtggaga ggctattcgg ctatgactgg gcacaacaga 4 30 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgcgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga rcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc 'atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 24 60 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 264 0 
gtgggagatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgcag 2820 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttccttgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
ctgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
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tgtggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 
cactacccct gcacrgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttgtaa cctggaggac 3780 
agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 
tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 
gtggacgtac aatacctgta cggtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4 020 
atgatgctgc tgatagctca agctgaggcc gccctagaga acctggtggt cctcaacgcg 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcgt atggccgcta 4200 
ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
ctgttcctcg ctaggctcat atggtggtta caatatttta tcaccagggc cgaggcacac 4380 
ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt catcctcctc 4440 
acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 
ggtccactca tggtgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4620 
gctctcatga agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4 680 
cgggactggg cccacgcggg cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 
tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4 8 60 
agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggcctactc ccaacagacg 4 920 
cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4 980 
ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 504 0 
gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
cccgtctcct. acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcacgct .534 0 
gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt ctatgggaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 54 60 
ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
agcactaagg tgccggctgc gtatgcaggc caagggtata aggtgcttgt cctgaacccg 5580 
tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tarcgaccct 5640 
aacatcagaa tcggggtaag gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 
gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 5940 
ggcaaagcca tccccatcga gaccatcaag ggggggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctgtccggcc tcggactcaa tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
acggacgctc taatgacggg ctttaccggc gacttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgcf'tggtacgagc tcacgcccgc cgagacctca 6420 
gttaggttgc gggcttacct aaacacacca gggttgcccg tctgccagga ccatctggag 6480 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 654 0 
aagcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 
gctcaggctc cacctccatc gtgggaccaa atgtggaagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
acgagcacct gggtgctggt aggcggagtc ctagcagctc tggccgcgta ttgcctgaca 6840 
acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccggc catcattccc 6900 
gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 
tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 
caaacagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 
atcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 7140 
ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 
atcaccagcc cgctcaccac ccaacatacc ctcctgttta acatcctggg gggatgggtg 7260 
gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 
gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga 7380 
gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagtt gccctccacc 7440 
gaggacctgg ttaacctact ccctgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 
gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7560 
aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgcct 7 620 
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gagagcgacg ctgcagcacg agtcactcag atcctctcta gtcttaccat cactcagctg 7680 
ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaacctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggagc acagatcacc 7920 
ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca 8040 
aattattcta gggcgctgtg gcgggtaggt gctgaggagt acgtggaggt tacgcgggtg 8100 
ggggatttcc actacgtgac gggcatgacc actgacaacg taaagtgccc gtgtcaggtt 8160 
ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tccaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 
gggtcacagc tcccatgcga gcctgaaccg gatgtagcag tgctcacttc catgctcacc 8340 
gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcccccc 84 00 
cccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 
acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggcaggag 8520 
atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 
gagccgctcc aaccggagga ggatgagagg ggagtatccg ttccggcgga gatcctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt ccrggaagga. cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct tcacggagga agaggacggt tgtcctgtca 8820 
gaatctaccg tgtcttctgc cttggcggag c:cgccacag agaccttcgg cagctccgaa 8880 
tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8 94 0 
gacgcgggat ccgacgttga gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggaggctag tgaggacgtc 9060 
gtctgctgcr cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 
gaaaccaagc tgcccatcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaaacctg cggcagaaga aggtcacctt tgacagactg 9240 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc tgacgccccc acattcggcc 9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 9480 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9840 
gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttacttg 9960 
aaggccgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 
gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatc - tacggggcct gttactccat tgagccactt 10440 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctccaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt accgcccttg 105 60 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10 620 
gctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct grctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 
ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 
tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 
tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 1104 0 
agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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CLAIMS 

A hepatitis C virus (HCV) cell culture system, which 
comprises mainly eukaryotic cells containing 
transfected HCV specific genetic material, 
characterized in that, 

the eukaryotic cells are human hepatoma cells and the 
transfected HCV specific genetic material is a HCV-RNA 
construct, which comprises the HCV specific RNA 
segments 5* NTR, NS3, NS4A, NS4B, NS5A, N35B, and 
3' NTR as well as an additional marker gene for 
selection (selection gene) . 

A cell culture system according to Claim 1, 
characterized in that, 

the hepatoma cells are derived from a usual hepatoma 
cell line. 

A cell culture system according to Claim 1, 
characterized in that, 

the hepatoma cells are extracted from a primary 
hepatoma cell culture. 

A HCV-RNA construct, characterized in that, 
it comprises the HCV specific RNA segments 5 1 NTR, 
NS3, NS4A, NS4B, NS5A, NS5B, and 3 1 NTR as well as an 
additional marker gene for selection (selection gene). 

A HCV-RNA construct according to Claim 4, 
characterized in that, 

it comprises one of the nucleotide sequences 
illustrated in the sequence protocols SEQ ID NO: 1 to 
SEQ ID NO: 11. 

A HCV-RNA construct according to Claim 4, 
characterized in that, 



CA 02303526 2000-03-31 



the 3' NTR has a nucleotide sequence, which has been 
selected from the group of nucleotide sequences (a) to 
(i) listed in the following: 

( a ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GCTTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCC 
CAGACTGGAGT 

(b) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTCTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCA 
GACTGGAGC 

( c ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTAAT 
CTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCA 
GACTGCAGC 

( d ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
AGTCTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGC 
CCAGACTGGAGT 

( e ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGC 
CCAGACTGGAGT 

( f ) ACGGGGAGCTAAACACTCCAGGCC AAT AGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTTTCTTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTG 
CCCAGACTGGAGT 

(g) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCC 
CAGACTGGAGT 

( h ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
TAATCTTTTTTTTTTTTTTCCTTTTTTTGAGAGAGAGAGTCTCACTCTGT 
TGCCCAGACTGGAGT 
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( i ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
AATCTTTTTTTTTTTTTTCTTTTTTTTTTGAGAGAGAGAGTCTCACTCTG 
TTGCCCAGACTGGAGT 

5 7. A HCV-RNA construct according to one of the Claims 4 
to 6, characterized in that, 

the marker gene for selection is a resistance gene and 
preferably an antibiotic resistance gene, 

10 8. A HCV-RNA construct according to one of the Claims 4 
to 6, characterized in that, 

the marker gene for selection is a neomycin 
phosphotransferase gene . 

15 9. A HCV-RNA construct according to one of the Claims 4 
to 8, characterized in that, 

the marker gene for selection is integrated in the 
HCV-RNA downstream from the 5' NTR. 

20 10. A HCV-RNA construct according to one of the Claims 4 
to 83, characterized in that, 

the marker gene for selection is linked to the HCV-RNA 
via a ribozyme or a recognition sequence for a 
ribozyme . 

11. A HCV-RNA construct according to one of the Claims 4 
to 10, characterized in that, 
it has an integrated reporter gene. 

30 12. A HCV-RNA construct according to Claim 11, 
characterized in that, 

the reporter gene is a gene from the group of the 
luciferase genes, the CAT gene (chloramphenicol acetyl 
transferase gene) , the lacZ gene (beta galactosidase 
35 gene) , the GFP gene (green fluorescence protein gene) , 

the GUS gene (glucuronidase gene) and the SEAP gene 
(secreted alkaline phosphatase gene) . 
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13. A HCV-RNA construct according to one of the Claims 4 
to 11, characterized in that, 

their replication has an influence on the expression 
5 of a (cellular) surrogate marker gene. 

14. A HCV-RNA construct according to one of the Claims 11 
to 13, characterized in that, 

the resistance gene is cloned into the open reading 
10 frame of the HCV-RNA in such a way that it will only 

be transferred to an active form after proteolytic 
processing . 

15. A HCV-RNA construct according to one of the Claims 11 
15 to 14, characterized in that, 

the reporter gene and the marker gene for selection 
are arranged in the construct in such a way, that they 
are expressed as a fusion protein. 

20 16. A cell culture system according to one of the Claims 1 
to 3, characterized in that, 

the HCV-RNA construct is a construct according to at 
least one of the Claims 4 to 15. 

25 17. A cell culture system according to Claim 1, 
characterized in that, 

the cells containing the HCV-RNA construct are lodged 
at the DSMZ, Braunschweig, Germany under the deposit 
number DSM ACC2394 (laboratory name HuBl 9-13) . 



30 

18 



35 



A use of a cell culture system according to one of the 
Claims 1 to 3 or 16 to 17 and/or a HCV-RNA construct 
according to one of the Claims 4 to 15 for the 
production and/or evaluation and/or testing of 
treatments and/or diagnostics for the treatment of HCV 
infections in particular. 
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A use of a cell culture system according to one of the 
Claims 1 to 3 or 16 to 17 and/or a HCV-RNA construct 
according to one of the Claims 4 to 15 for the 
production of a vaccine against HCV infections. 

A use a HCV-RNA construct according to one of the 
Claims 4 to 15 for the production of a liver cell 
specific vector in gene therapy. 

A HCV-RNA construct according to one of the Claims 4 
to 15, characterized in that, 

it has an integrated foreign gene and can be used to 
transfect this foreign gene in a target cell, which 
can be used to express this foreign gene. 

A process for the production of cell culture adapted 
mutants of a HCV-RNA construct according to one of the 
claims 4 to 15, in which the mutants have increased 
replication efficiency compared to the HCV-RNA 
construct, 

characterized in that, 

a cell culture system according to claim 1, in which 
the transfected HCV specific genetic material is a 
HCV-RNA construct with a selection gene according to 
one of the claims 4 to 15, is cultivated on/in the 
selection medium corresponding to the selection gene, 
that the cell clones created are collected, and that 
the HCV-RNA constructs or parts thereof are isolated 
from these cell clones. 

A process according to claim 22, characterized in 
that, 

the isolated HCV-RNA constructs are passaged at least 
one more time . That is, they are transfected in cells 
of a cell culture system according to claim 1. The 
cell culture system of claim 1 thereby obtained, in 
which the transfected HCV specific genetic material is 
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the isolated HCV-RNA construct with a selection gene, 
is cultivated on/in the selection medium corresponding 
to the selection gene, the cell clones created are 
collected, and the HCV-RNA constructs are isolated 
from these cell clones, 

A process for the production of mutants of a HCV full- 
length genome or of a HCV subgenorae or any HCV 
construct with increased replication efficiency in 
comparison to the original HCV full-length genome or 
subgenome or HCV-RNA construct, 
characterized, in that, 

a cell culture adapted mutant of a HCV-RNA construct 
is produced and isolated by a process according to 
claim 22 or 23, 

the nucleotide and amino acid sequence of these 
mutants is determined and the type, number and 
positions of the nucleotide and amino acid mutations 
is determined by comparison with the nucleotide and 
amino acid sequence of the original HCV-RNA construct, 
and these mutations are introduced in an (isolated) 
HCV full-length genome or a HCV subgenome or any HCV- 
RNA construct, either by site-directed mutagenesis or 
by exchange of fragments containing the relevant 
mutations 

A cell culture adapted HCV-RNA construct with high 
replication efficiency, 
characterized in that, 

it is derivable from a HCV-RNA construct according to 
one of the claims 4 to 15 through nucleotide and/or 
amino acid mutations and that it is obtainable with a 
process according to one of the claims 22 to 24. 

A cell culture adapted HCV-RNA construct according to 
claim 25, 

characterized in that. 
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it carries one or several of the following amino acid 
exchanges: 1283 arg -> gly and/or 1383 glu -> ala 
and/or 1577 lys -> arg and/or 1609 lys -> glu and/or 
1936 pro -> ser and/or 2163 glu -> gly and/or 2330 lys 
-> glu and/or 2442 ile -> val. 

A cell culture adapted HCV-RNA construct according to 
claim 25 or 26, 
characterized in that, 

it displays one or several of the nucleotide and/or 
amino acid exchanges shown in Table 3, where Table 3 
is an integral part of this claim. 

Cell culture adapted mutant of a HCV-RNA construct or 
of a HCV full-length genome with increased replication 
efficiency in comparison to the original HCV-RNA 
construct or original HCV full-length genome, 
characterized in that, 

it is obtainable with a process in which the number 
and type of mutations in a cell culture adapted HCV- 
RNA construct according to claim 24 are determined 
through sequence analysis and sequence comparison, and 
these mutations are introduced in a HCV-RNA construct, 
particularly in a HCV-RNA construct according to one 
of the claims 4 to 15, or in an (isolated) HCV-RNA 
full-length genome, either by targeted mutagenesis or 
by exchange of sequence sections , containing the 
relevant mutations. 

Hepatitis C virus particles or virus-like particles, 
characterized in that, 

they are obtainable by a process according to one of 
the claims 22-24. 

Cells infected with Hepatitis C virus particles or 
virus-like particles, according to claim 29. 
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